Augmented reality communication method, apparatus, and system

ABSTRACT

An augmented reality communication method in which AR is integrated into voice and video calls is disclosed. An auxiliary data channel and a media stream channel are established between a terminal device and an AR media processing network element. In addition, AR auxiliary data and a media stream have a same transmission path. The AR media processing network element may be an IMS access gateway having an AR processing capability or a media resource function MRF.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application No.PCT/CN2022/083191, filed on Mar. 25, 2022, which claims priority toChinese Patent Application No. 202110329252.9, filed on Mar. 27, 2021.The disclosures of the aforementioned applications are herebyincorporated by reference in their entireties.

TECHNICAL FIELD

Embodiments of this application relate to the field of communicationtechnologies, and in particular, to an augmented reality communicationmethod, apparatus, and system.

BACKGROUND

Constructed on a 4th generation (4G) network, voice over long termevolution (VoLTE) is an end-to-end voice solution under an all-IPcondition. During communication between users, VoLTE offers shorter callconnection time and higher voice and video call quality. Augmentedreality (AR) is a technology that cleverly integrates virtualinformation with the real world. A plurality of technical means such asmultimedia, three-dimensional modeling, real-time tracking andregistration, intelligent interaction, and sensing are widely used tosimulate virtual information such as a text, an image, athree-dimensional model, music, and a video that are generated by acomputer. Then, simulated information is applied to the real world, andthe two types of information complement each other, to implement“augmentation” of the real world. The augmented reality technology cannot only effectively reflect content of the real world, but also displaycontent of the virtual information content. Currently, there is noeffective implementation in which AR is integrated into voice and videocalls.

SUMMARY

Embodiments of this application provide an augmented realitycommunication method, apparatus, and system, to provide animplementation in which AR is integrated into voice and video calls,thereby improving user experience.

According to a first aspect, an embodiment of this application providesan augmented reality AR communication system, including an internetprotocol multimedia subsystem IMS core network element and a first ARmedia processing network element. The IMS core network element isconfigured to control the establishing of a first media stream channelbetween the first AR media processing network element and a firstterminal device and an auxiliary data channel between the first AR mediaprocessing network element and the first terminal device. The first ARmedia processing network element is configured to: receive first ARauxiliary data from the first terminal device through the establishedauxiliary data channel, and receive a first media stream for ARcommunication between the first terminal device and a second terminaldevice through the established media stream channel. The first AR mediaprocessing network element is further configured to: perform augmentedprocessing on the media stream based on the first AR auxiliary data toobtain a first AR media stream, and send the first AR media stream tothe first terminal device through the media stream channel. According tothe foregoing solution, the auxiliary data channel and the media streamchannel are established between the terminal device and the AR mediaprocessing network element, so that AR is effectively integrated intovoice and video calls. In addition, the AR auxiliary data and the mediastream have a same transmission path, so that a latency can be reduced.

In a possible design, the first AR media processing network element maybe an IMS access gateway having an AR processing capability or a mediaresource function MRF. For example, when point-to-point communication isperformed between the terminal devices, the first AR media processingnetwork element may be an IMS access gateway. For another example, whenthe terminal device performs an AR conference, the first AR mediaprocessing network element may be the MRF.

In a possible design, the first AR media processing network element islocated in an IMS network to which the first terminal device belongs (orin which the first terminal device is located). The AR media processingnetwork element in the IMS network in which the first terminal device islocated has the AR processing capability, and the AR media processingnetwork element in the IMS network in which the first terminal device islocated performs augmented processing according to a proximityprinciple, so that a latency can be further reduced.

In a possible design, the system further includes a second AR mediaprocessing network element located in an IMS network to which the secondterminal device belongs (or in which the second terminal device islocated). The first AR media processing network element is furtherconfigured to send the AR media stream to the second AR media processingnetwork element. The second AR media processing network element isconfigured to: receive the AR media stream sent by the first AR mediaprocessing network element, and send the AR media stream to the secondterminal device.

In a possible design, the first AR media processing network element islocated in an IMS network to which the second terminal device belongs(or in which the second terminal device is located), the system furtherincludes a third AR media processing network element located in an IMSnetwork to which the first terminal device belongs (or in which thefirst terminal device is located), the third AR media processing networkelement does not have the AR processing capability, and the media streamchannel between the first AR media processing network element and thefirst terminal device includes a first media stream channel between thefirst AR media processing network element and the third AR mediaprocessing network element and a second media stream channel between thethird AR media processing network element and the first terminal device.

The third AR media processing network element is configured to: receivethe media stream from the first terminal device through the second mediastream channel, and forward the media stream to the first AR mediaprocessing network element through the first media stream channel; andreceive an AR media stream from the first AR media processing networkelement through the first media stream channel, and send the AR mediastream to the first terminal device through the second media streamchannel.

In the foregoing design, when the AR media processing network element inthe IMS network in which the first terminal device is located does nothave the AR processing capability, the AR media processing networkelement in the IMS network in which the second terminal device islocated is configured to provide an augmented processing function forthe first terminal device. A feasible solution is provided forintegrating AR into voice and video calls.

In a possible design, the first AR media processing network element isfurther configured to receive target object information from the firstterminal device through the auxiliary data channel. The AR mediaprocessing network element is further configured to obtain, from athird-party server, a target object corresponding to the target objectinformation.

In the foregoing design, the third-party server may provide a materialof the target object for the AR communication between the terminaldevices. In this application, the target object is obtained from thethird-party server through the auxiliary data channel.

In a possible design, the target object is a virtual object or anidentifier of a virtual object. The first AR media processing networkelement is further configured to send the virtual object or theidentifier of the virtual object to the first terminal device throughthe auxiliary data channel. Because the virtual object is required in aplurality of scenarios, the virtual object is obtained from thethird-party server through the auxiliary data channel in the foregoingdesign.

In a possible design, the target object information includes a virtualobject type requested by the first terminal device, and the first ARmedia processing network element sends a plurality of virtual objectidentifiers of the virtual object type to the first terminal devicethrough the auxiliary data channel.

In a possible design, after receiving the plurality of virtual objectidentifiers corresponding to the virtual objects, the first AR mediaprocessing network element stores correspondences between the virtualobject types and the plurality of virtual object identifiers.

In a possible design, when receiving the virtual object type through theauxiliary data channel, the first AR media processing network elementmay first determine whether the first AR media processing networkelement stores the virtual object identifier corresponding to thevirtual object type. If yes, the first AR media processing networkelement may not obtain, from the third-party server, the virtual objectidentifier corresponding to the virtual object type. If the first ARmedia processing network element does not store the virtual objectidentifier corresponding to the virtual object type, the first AR mediaprocessing network element obtains, from the third-party server, thevirtual object identifier corresponding to the virtual object type. Inthis manner, unnecessary resource waste caused by the obtaining can bereduced.

In a possible design, the method further includes:

The first AR media processing network element receives a first virtualobject identifier from the first terminal device through the auxiliarydata channel, where the first virtual object identifier is foridentifying a first virtual object selected by a user of the firstterminal device from a plurality of virtual objects. The first AR mediaprocessing network element obtains the first virtual objectcorresponding to the first virtual object identifier from thethird-party server, and sends the first virtual object to the firstterminal device through the auxiliary data channel.

In a possible design, the target object is a real background object, andthe first AR media processing network element is further configured toreceive, through a media stream channel, a second media stream that isgenerated during the AR communication and that is sent by the firstterminal device.

The first AR media processing network element is further configured to:render and synthesize the real background object and the second mediastream to obtain a second AR media stream; and send the second AR mediastream to the first terminal device through the media stream channel. Insome application scenarios, a background picture corresponding to aforeground object is fixed. In this scenario, a rendering operationbetween the media stream and the background picture is performed by theAR media processing network element, so that pressure on uplinkbandwidth of the terminal device can be reduced.

In a possible design, the system further includes an application server.

The application server is configured to: receive a first request messagefrom the AR media processing network element, and forward the firstrequest message to the third-party server; and receive a first responsemessage from the third-party server, and forward the first responsemessage to the first AR media processing network element.

In a possible design, the application server is configured with anaddress of the third-party server; and the application server isspecifically configured to: forward the first request message to thethird-party server based on the address of the third-party server, andforward the first response message to the first AR media processingnetwork element.

In a possible design, the application server is further configured tosend an address of the third-party server to the first AR mediaprocessing network element in a procedure of establishing the auxiliarydata channel between the first terminal device and the first AR mediaprocessing network element.

According to a second aspect, an embodiment of this application providesan augmented reality AR communication method, including: An internetprotocol multimedia subsystem IMS core network element receives a firstrequest message from a first terminal device, where the first requestmessage is for requesting for establishing of an auxiliary data channel,and the auxiliary data channel is for transmitting auxiliary data for ARcommunication between the first terminal device and an AR mediaprocessing network element. The IMS core network element controls theestablishing of the auxiliary data channel between the AR mediaprocessing network element and the first terminal device. According tothe foregoing solution, a simple and effective manner of creating theauxiliary data channel is provided, to obtain the AR auxiliary data forthe AR communication through the auxiliary data channel, and performaugmented processing on a media stream for the AR communication duringthe AR communication.

In a possible design, that the IMS core network element controls theestablishing of the auxiliary data channel between the AR mediaprocessing network element and the first terminal device includes: TheIMS core network element sends a second request message to the AR mediaprocessing network element, where the second request message is forrequesting for establishing of the auxiliary data channel between the ARmedia processing network element and the first terminal device. The IMScore network element receives a second response message sent by the ARmedia processing network element, where the second response messageindicates to the IMS core network element that the AR media processingnetwork element agrees upon the establishing of the auxiliary datachannel. The IMS core network element sends a first response message tothe first terminal device, where the first response message indicates tothe first terminal device that the AR media processing network elementis ready for establishing of the auxiliary data channel.

In a possible design, the AR media processing network element is an IMSaccess gateway having an AR processing capability or a media resourcefunction MRF. For example, when point-to-point communication isperformed between the terminal devices, the first AR media processingnetwork element may be an IMS access gateway. For another example, whenan AR conference is performed between the terminal devices, the first ARmedia processing network element may be an MRF.

In a possible design, the IMS core network element controls theestablishing of a media stream channel between the AR media processingnetwork element and the first terminal device. According to theforegoing solution, the auxiliary data channel and the media stream areestablished between the IMS core network element and the AR mediaprocessing network element, so that AR is effectively integrated intovoice and video calls. In addition, the AR auxiliary data and the mediastream have a same transmission path, so that a latency can be reduced.

In a possible design, the AR media processing network element is locatedin an IMS network to which the first terminal device belongs or islocated in an IMS network to which a second terminal device belongs. TheAR media processing network element in the IMS network in which thefirst terminal device is located has an AR processing capability, andthe AR media processing network element in the IMS network in which thefirst terminal device is located performs augmented processing accordingto a proximity principle, so that a latency can be further reduced. Whenthe AR media processing network element in the IMS network in which thefirst terminal device is located does not have the AR processingcapability, the AR media processing network element in the IMS networkin which the second terminal device is located is configured to providean augmented processing function for the first terminal device. Afeasible solution is provided for integrating AR into voice and videocalls.

In a possible design, the first request message and the second requestmessage carry a first description parameter that is of the firstterminal device and that is for establishing the auxiliary data channel.

The first response message and the second response message carry asecond description parameter that is of the AR media processing networkelement and that is for establishing the auxiliary data channel.

In a possible design, the method further includes: The IMS core networkelement sends a query request to the AR media processing networkelement, where the query request is for querying whether the AR mediaprocessing network element has an AR processing capability. The IMS corenetwork element receives a query response sent by the AR mediaprocessing network element, where the query response carries informationindicating that the AR media processing network element has the ARprocessing capability. In the foregoing design, a feasible and effectivemanner of determining whether the AR media processing network elementhas the AR processing capability is provided.

In a possible design, the AR media processing network element is locatedin an IMS network to which a second terminal device belongs, and themethod further includes:

Before sending a query message to the AR media processing networkelement located in the IMS network to which the second terminal devicebelongs, the IMS core network element determines that the AR mediaprocessing network element that provides a service for the firstterminal device does not have the AR processing capability.

In a possible design, a procedure of establishing AR communicationbetween the first terminal device and the second terminal device is aprocedure of establishing an AR session between the first terminaldevice and the second terminal device, a procedure of updating an ARsession between the first terminal device and the second terminaldevice, or a procedure of re-establishing an AR session between thefirst terminal device and the second terminal device.

In a possible design, the first request message is a session initiationprotocol SIP request INVITE message, or a SIP re-establishment requestmessage, or a SIP update request message.

In a possible design, the method further includes: The IMS core networkelement controls the establishing of a media stream channel between theAR media processing network element and the first terminal device, wherethe media stream channel is for transmitting a media stream fortransmitting content for AR communication between the first terminaldevice and the AR media processing network element.

In a possible design, a third request message from the first terminaldevice is received, where the third request message is further forrequesting for establishing of the media stream channel. That the IMScore network element controls establishing of a media stream channelbetween the AR media processing network element and the first terminaldevice includes: The IMS core network element sends a fourth requestmessage to the AR media processing network element, where the fourthrequest message is for requesting for establishing of the media streamchannel between the AR media processing network element and the firstterminal device. The IMS core network element receives a fourth responsemessage sent by the AR media processing network element, where thefourth response message indicates to the IMS core network element thatthe AR media processing network element agrees upon the establishing ofthe media stream channel. The IMS core network element sends a thirdresponse message to the first terminal device, where the third responsemessage indicates to the first terminal device that the AR mediaprocessing network element is ready for establishing of the media streamchannel.

It is to be noted that the auxiliary data channel and the media streamchannel may be established in a same procedure; or the media streamchannel may be first established, and the auxiliary data channel is thenestablished.

For example, the first request message and the third request message maybe carried in a same message, the second request message and the fourthrequest message may be carried in a same message, the first responsemessage and the third response message may be carried in a same message,and the second response message and the fourth response message may becarried in a same message.

According to a third aspect, an embodiment of this application providesan augmented reality AR communication method, including:

An auxiliary data channel is established between an AR media processingnetwork element and a first terminal device under a trigger of the firstterminal device.

In a possible design, a media stream channel for transmitting contentfor AR communication is established between the AR media processingnetwork element and the first terminal device.

In a possible design, an AR media processing network element receives asecond request message sent by an internet protocol multimedia subsystemIMS core network element, where the second request message is forrequesting for establishing of an auxiliary data channel between the ARmedia processing network element and a first terminal device, and theauxiliary data channel is for transmitting auxiliary data for ARcommunication between the first terminal device and the AR mediaprocessing network element. The AR media processing network elementsends a second response message to the IMS core network element, wherethe second response message indicates to the IMS core network elementthat the AR media processing network element agrees upon theestablishing of the auxiliary data channel.

In a possible design, the method further includes: The AR mediaprocessing network element receives AR auxiliary data from the firstterminal device through the auxiliary data channel after theestablishment of the auxiliary data channel is completed.

In a possible design, the method further includes: The AR mediaprocessing network element receives a media stream for the ARcommunication from the first terminal device through the media streamchannel. The AR media processing network element performs mediaaugmented processing on the media stream for the AR communication basedon the AR auxiliary data to obtain an AR media stream. The AR mediaprocessing network element sends the AR media stream obtained throughaugmented processing to the first terminal device through the mediastream channel. According to the foregoing solution, the auxiliary datachannel and the media stream are established between the terminal deviceand the AR media processing network element, so that AR is effectivelyintegrated into voice and video calls. In addition, the AR auxiliarydata and the media stream have a same transmission path, so that alatency can be reduced.

In a possible design, the method further includes: The AR mediaprocessing network element sends the AR media stream obtained throughaugmented processing to a second terminal device.

In a possible design, the AR auxiliary data includes one or more ofoperation data of a virtual model, annotation or tracking data of atarget object, identification data of a facial expression or a bodyaction, or pose data.

In a possible design, the AR media processing network element is an IMSaccess gateway having an AR processing capability or a media resourcefunction MRF.

In a possible design, the AR media processing network element is locatedin an IMS network to which the first terminal device belongs (or inwhich the first terminal device is located) or is located in an IMSnetwork to which the second terminal device belongs (or in which thesecond terminal device is located).

In a possible design, the second request message carries a firstdescription parameter that is of the first terminal device and that isfor establishing the auxiliary data channel, and the method furtherincludes: The AR media processing network element determines, from thefirst description parameter, a second description parameter supported bythe AR media processing network element, where the second responsemessage carries the second description parameter.

According to a fourth aspect, an embodiment of this application providesan augmented reality AR communication method, including: A firstterminal device sends a first request message to an IMS core networkelement, where the first request message is for requesting forestablishing of an auxiliary data channel. The first terminal devicereceives a first response message sent by the IMS core network element,where the first response message indicates that the AR media processingnetwork element completes the establishment of the auxiliary datachannel, and the auxiliary data channel is for transmitting auxiliarydata for AR communication between the first terminal device and the ARmedia processing network element.

In a possible design, the method further includes: establishing, betweenthe first terminal device and the AR media processing network element, amedia stream channel for transmitting content for the AR communication.

In a possible design, that a media stream channel for transmittingcontent for the AR communication is established between the firstterminal device and the AR media processing network element includes:

The first terminal device sends a third request message to the IMS corenetwork element, where the third request message is for requesting forestablishing of the media stream channel.

The first terminal device receives a third response message sent by theIMS core network element, where the third response message indicates tothe first terminal device that the AR media processing network elementis ready for establishing of the media stream channel.

In a possible design, the first request message is further forrequesting for establishing of a media stream channel. The firstresponse message further indicates that the AR media processing networkelement is ready for establishing of the media stream channel.

In a possible design, the method further includes:

The first terminal device sends augmented reality AR auxiliary data tothe AR media processing network element through the auxiliary datachannel, and sends a media stream to the AR media processing networkelement through the media stream channel, where the media stream isgenerated during AR communication between the first terminal device anda second terminal device, where the AR auxiliary data is used by the ARmedia processing network element to perform media augmented processingon the media stream for the AR communication.

The first terminal device receives an AR media stream obtained throughaugmented processing from the AR media processing network elementthrough the media stream channel.

In a possible design, the method further includes:

The first terminal device receives, through the media stream channel, anAR media stream that is of the second terminal device and that is sentby the AR media processing network element.

In a possible design, the first request message carries a firstdescription parameter that is of the first terminal device and that isfor establishing the auxiliary data channel.

The first response message carries a second description parameter thatis of the AR media processing network element and that is forestablishing the auxiliary data channel.

In a possible design, the AR media processing network element is locatedin an IMS network to which the first terminal device belongs (or inwhich the first terminal device is located) or is located in an IMSnetwork to which the second terminal device belongs (or in which thesecond terminal device is located).

According to a fifth aspect, an embodiment of this application furtherprovides an apparatus. The apparatus is used in an IMS core networkelement. For beneficial effects, refer to the descriptions in the secondaspect. Details are not described herein again. The apparatus has afunction of implementing the behavior in the method example in thesecond aspect. The function may be implemented by hardware, or may beimplemented by hardware executing corresponding software. The hardwareor the software includes one or more modules corresponding to theforegoing function. In a possible design, a structure of the apparatusmay include a sending unit and a receiving unit. The units may performcorresponding functions in the method example in the second aspect. Fordetails, refer to the detailed descriptions in the method example.Details are not described herein again.

According to a sixth aspect, an embodiment of this application furtherprovides an apparatus. The apparatus is used in an AR media processingnetwork element. For beneficial effects, refer to the descriptions inthe third aspect. Details are not described herein again. The apparatushas a function of implementing the behavior in the method example in thethird aspect. The function may be implemented by hardware, or may beimplemented by hardware executing corresponding software. The hardwareor the software includes one or more modules corresponding to theforegoing function. In a possible design, a structure of the apparatusmay include a sending unit, a receiving unit, and a processing unit. Theunits may perform corresponding functions in the method example in thethird aspect. For details, refer to the detailed descriptions in themethod example. Details are not described herein again.

According to a seventh aspect, an embodiment of this application furtherprovides an apparatus. The apparatus is used in the terminal device. Forbeneficial effects, refer to the descriptions in the fourth aspect.Details are not described herein again. The apparatus has a function ofimplementing the behavior in the method example in the fourth aspect.The function may be implemented by hardware, or may be implemented byhardware executing corresponding software. The hardware or the softwareincludes one or more modules corresponding to the foregoing function. Ina possible design, a structure of the apparatus includes a sending unitand a receiving unit. The units may perform corresponding functions inthe method example in the fourth aspect. For details, refer to thedetailed descriptions in the method example. Details are not describedherein again.

According to an eighth aspect, an embodiment of this application furtherprovides an apparatus. The apparatus is used in an IMS core networkelement. For beneficial effects, refer to the descriptions in the secondaspect. Details are not described herein again. A structure of thecommunication apparatus includes a processor and a communicationinterface, and may further include a memory. The processor is configuredto support the IMS core network element in performing a correspondingfunction in the method in the second aspect through the communicationinterface. The memory is coupled to the processor, and stores programinstructions and data that are necessary for the communicationapparatus.

According to a ninth aspect, an embodiment of this application furtherprovides an apparatus. The apparatus is used in an AR media processingnetwork element. For beneficial effects, refer to the descriptions inthe third aspect. Details are not described herein again. A structure ofthe apparatus includes a processor and a communication interface, andmay further include a memory. The processor is configured to support theAR media processing network element in performing a correspondingfunction in the method in the third aspect through the communicationinterface. The memory is coupled to the processor and stores programinstructions and data that are necessary for the apparatus.

According to a tenth aspect, an embodiment of this application furtherprovides an apparatus. The apparatus is used in a terminal device. Forbeneficial effects, refer to the descriptions in the fourth aspect.Details are not described herein again. A structure of the apparatusincludes a processor and a communication interface, and may furtherinclude a memory. The processor is configured to support the terminaldevice in performing a corresponding function in the method in thefourth aspect through the communication interface. The memory is coupledto the processor and stores program instructions and data that arenecessary for the apparatus.

According to an eleventh aspect, an embodiment of this applicationfurther provides an apparatus, and the apparatus is used in anapplication server. A structure of the communication apparatus includesa processor and a communication interface, and may further include amemory. The processor is configured to support the application server inperforming a corresponding function of the application server in any oneof the foregoing aspects through the communication interface. The memoryis coupled to the processor, and stores program instructions and datathat are necessary for the communication apparatus.

According to a twelfth aspect, this application further provides acomputer-readable storage medium. The computer-readable storage mediumstores instructions. When the instructions are run on a computer, thecomputer is enabled to perform the methods according to the foregoingaspects.

According to a thirteenth aspect, this application further provides acomputer program product including instructions. When the computerprogram product runs on a computer, the computer is enabled to performthe methods according to the foregoing aspects.

According to a fourteenth aspect, this application further provides acomputer chip. The chip is connected to a memory. The chip is configuredto read and execute a software program stored in the memory, to performthe methods according to the foregoing aspects.

BRIEF DESCRIPTION OF THE DRAWINGS

For FIG. 1 is a schematic diagram of an architecture of a possible ARcommunication system according to an embodiment of this application;

FIG. 2A is a schematic diagram of an architecture of another possible ARcommunication system according to an embodiment of this application;

FIG. 2B is a schematic diagram of an architecture of still anotherpossible AR communication system according to an embodiment of thisapplication;

FIG. 2C is a schematic diagram of an architecture of yet anotherpossible AR communication system according to an embodiment of thisapplication;

FIG. 3 is a schematic diagram of an architecture of another possible ARcommunication system according to an embodiment of this application;

FIG. 4 is a schematic diagram of an architecture of another possible ARcommunication system according to an embodiment of this application;

FIG. 5 is a schematic diagram of an architecture of another possible ARcommunication system according to an embodiment of this application;

FIG. 6 is a schematic flowchart of an AR communication method accordingto an embodiment of this application;

FIG. 7A is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 7B is a schematic flowchart of still another AR communicationmethod according to an embodiment of this application;

FIG. 8A-1 and FIG. 8A-2 are a schematic flowchart of another ARcommunication method according to an embodiment of this application;

FIG. 8B is a schematic diagram of a termination T1 and a termination T2on an IMS AGW according to an embodiment of this application;

FIG. 9 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 10A and FIG. 10B are a schematic flowchart of another ARcommunication method according to an embodiment of this application;

FIG. 11 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 12A is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 12B is a schematic flowchart of still another AR communicationmethod according to an embodiment of this application;

FIG. 12C is a schematic flowchart of yet another AR communication methodaccording to an embodiment of this application;

FIG. 13 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 14 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 15 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 16 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 17 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 18 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 19 is a schematic flowchart of another AR communication methodaccording to an embodiment of this application;

FIG. 20 is a schematic diagram of a structure of an apparatus accordingto an embodiment of this application;

FIG. 21 is a schematic diagram of a structure of another apparatusaccording to an embodiment of this application;

FIG. 22 is a schematic diagram of a structure of another apparatusaccording to an embodiment of this application; and

FIG. 23 is a schematic diagram of a structure of another apparatusaccording to an embodiment of this application.

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

This application provides an AR communication method and apparatus, toprovide an implementation in which AR is integrated into voice and videocalls, thereby improving user experience. The voice and video calls maybe, but are not limited to, VoLTE, and may also be applicable to voiceand video calls provided by a future technology. The AR has threecharacteristics: virtual-reality integration, real-time interaction, andthree-dimensional registration. The virtual-reality integration is tosuperimpose a virtual object and information that are generated by acomputer to a real-world scene, to understand the real scene moreintuitively and deeply. Augmented information may be non-geometricinformation related to a real object, for example, a video and a text;or may be geometric information, for example, a virtualthree-dimensional object and scene. The real-time interaction isreal-time interaction between people and an augmented realityenvironment in a natural manner via an interaction interface device inan augmented reality system. The “registration” in the three-dimensionalregistration may be interpreted as tracking and locating. Thethree-dimensional registration is that a virtual object generated by acomputer is in a one-to-one correspondence with a real environment, anda correct alignment relationship is continuously maintained when a usermoves in the real environment.

The term “and/or” in this application describes only an associationrelationship for describing associated objects and represents that threerelationships may exist. For example, A and/or B may represent thefollowing three cases: Only A exists, both A and B exist, and only Bexists. In addition, the character “/” in this specification usuallyindicates an “or” relationship between the associated objects. The term“at least one” in this application means one or more, that is, includesone, two, three, or more; and the term “a plurality of” means two ormore, that is, includes two, three, or more. In addition, it can beunderstood that in the descriptions of this application, terms such as“first” and “second” are merely used for distinguishing and description,but cannot be understood as indicating or implying relative importance,or cannot be understood as indicating or implying a sequence. The term“at least one of the following items (pieces)” or a similar expressionthereof indicates any combination of these items, including a singleitem (piece) or any combination of a plurality of items (pieces). Forexample, at least one item (piece) of a, b, or c may indicate: a, b, c,a and b, a and c, b and c, or a, b, and c, where a, b, and c may besingular or plural. It can be understood that in embodiments of thisapplication, “B corresponding to A” indicates that B is associated withA, and B may be determined according to A. However, it can be furtherunderstood that determining B based on A does not mean that B isdetermined based only on A. B may alternatively be determined based on Aand/or other information. In addition, the terms “include” and “have” inembodiments, claims, and accompanying drawings of this application arenot exclusive. For example, a process, method, system, product, ordevice including a series of steps or modules is not limited to theenumerated steps or modules, and may further include a step or modulethat is not enumerated.

The following describes in detail solutions provided in embodiments ofthis application with reference to the accompanying drawings.

FIG. 1 is a schematic diagram of an architecture of an AR communicationsystem according to an embodiment of this application. The ARcommunication system includes an AR media processing network element andan AR control network element. The AR media processing network elementprovides a media processing service for a terminal device that has an ARcapability. The AR media processing network element has an AR user plane(AR User plane, AR-U) function, and may also be referred to as an AR-Unetwork element. The AR control network element has an AR control planefunction, and may be referred to as an AR control plane (AR Controlplane, AR-C) network element. In this embodiment, a network element maybe referred to as a device. For example, the AR media processing networkelement may be referred to as an AR media processing device.

The terminal device that has the AR capability may have one or more ofthe following logical functions: supporting acquisition of audio andvideo data in a real scene via a camera and/or a sensor, supporting alogical operation function (for example, a visual engine may bedeployed), supporting low-latency encoding and lightweight imagerendering, supporting obtaining of a virtual object from a cloud, andthe like. The terminal device in this embodiment of this application maybe a device configured with a camera and having a video call function.For example, the terminal device may be a wearable device (for example,an electronic watch), or the terminal device may be a device, forexample, a mobile phone or a tablet computer. A specific form of theterminal device is not specifically limited in this embodiment of thisapplication.

The AR media processing network element has an AR media processingfunction. For example, the AR media processing function includes one ormore of three-dimensional modeling, object identification, tracking andlocating, interaction perception, virtual-scene rendering,virtual-reality integration, and the like. The three-dimensionalmodeling is to perform three-dimensional construction on a realenvironment based on data acquired by a terminal device. The objectidentification is to identify a concerned object of a terminal deviceuser. The tracking and locating is to track and locate a concernedobject of a terminal device user. The interaction perception is toimplement consistency of a plurality of senses by using a userinteraction technology. The virtual-scene rendering is to render avirtual scene. The virtual-reality integration is to integrate a virtualobject with a real object.

The AR media processing network element supports establishment of amedia stream channel and an auxiliary data channel with the terminaldevice. The media stream channel is for transmitting, between theterminal device and the AR media processing network element, AR coredata (or referred to as media stream data, which may also be referred toas a media stream for short), for example, an audio and video stream.The auxiliary data channel is for transmitting AR auxiliary data betweenthe terminal device and the AR media processing network element. The ARauxiliary data (or AR specific data) is a general name of non-media datatransmitted between the terminal device and the AR media processingnetwork element.

In an example, as shown in Table 1, a data type that may be included inAR auxiliary data in an uplink direction and a video data type that isincluded in AR core data are described as an example.

TABLE 1 Data type Data subtype Description AR Operation type Operationof a 2D/3D model auxiliary Identification Identification of a facialexpression/body data type movement Labeling type Labeling/Tracking of atarget object Pose type Pose data acquired by a camera Tactility typeTactile data acquired by a sensor AR core Real-scene Real-scene videoshot by the terminal data video Real-scene The terminal completestracking and three- logic dimensional reconstruction of the real-scenevideo, and obtains the corresponding real- scene logic Location Theterminal only needs to upload the information location informationrelated to the real scene.

In an example, as shown in Table 2, a data type that may be included inAR auxiliary data in a downlink direction and a video data type that isincluded in the audio and video streams are described as an example.

TABLE 2 Data Data type subtype Description AR Model In response to arequest from a device side, support auxiliary type 2D/3D model downloaddata AR core Virtual The virtual scene rendered by the cloud, includingdata scene a virtual scene, a virtual role, a lighting special effect,and the like is delivered to the terminal, and the terminal completesvirtual-reality integration based on three-dimensional reconstructionlogic Virtual- The cloud only needs to complete logical scenecalculation of the virtual scene based on location logic tracking andthree-dimensional reconstruction technologies and deliver thevirtual-scene logic to the terminal, and the terminal completes virtual-scene rendering and virtual-reality integration based on thevirtual-scene logic

It can be understood that Table 1 and Table 2 are merely used as anexample, and do not constitute a specific limitation on the AR auxiliarydata and the AR core data.

The AR control network element has functions of a session associationand service control. In an AR session scenario, the session associationmay include an association between the media stream between the terminaldevice and the AR media processing network element and a signalingstream between the AR media processing network element and an AS; or anassociation among the media stream between the terminal device and theAR media processing network element, a signaling stream between the ARmedia processing network element and an AS, and a message stream betweenthe AS and a third-party server. The third-party server is configured toprovide a material required for AR processing, such as a virtual objectand a real background object. The service control may be to control ARmedia processing based on a service feature or the like. For example,different service features correspond to different AR media processingmanners, and the AR control network element may indicate different ARmedia processing manners to the AR media processing network elementbased on different service features of the terminal device.

In a possible implementation, the AR media processing network elementmay be an IP multimedia subsystem (IMS) access gateway (AGW) or a mediaresource function (MRF). The AR control network element may be anapplication server.

As an example, as shown in FIG. 2A, an AR media processing networkelement is an IMS AGW, and an AR communication system includes the IMSAGW and an IMS core. The IMS core controls establishment of an auxiliarydata channel and a media stream channel between the IMS AGW having an ARprocessing capability and a terminal device that has an AR capability.The auxiliary data channel is for transmitting AR auxiliary data for ARcommunication between at least two terminal devices. The media streamchannel is for transmitting a media stream for AR communication betweenthe at least two terminal devices. The IMS AGW may provide a mediaprocessing service for the terminal device. For example, when receivingthe media stream through the media stream channel and the AR auxiliarydata through the auxiliary data channel, the IMS AGW may performaugmented processing on the media stream by using the AR auxiliary data.

The AR communication system may further include an application server(AS). The AR communication system may further include at least twoterminal devices. IMS AGWs respectively serving different terminaldevices may be the same or different. In FIG. 2A, for example, the twoterminal devices are respectively a first terminal device and a secondterminal device, and IMS AGWs serving the first terminal device and thesecond terminal device are different, and are respectively an IMS AGW 1and an IMS AGW 2. The two terminal devices in AR communication may bothhave the AR capability; or one terminal device may have the ARcapability, and the other terminal device does not have the ARcapability. In FIG. 2A, for example, both the first terminal device andthe second terminal device have the AR capability. For example,different terminal devices access different IMS networks, and the IMSAGWs serving the different terminal devices are different. Different IMSAGWs serving the two terminal devices in AR communication may both havethe AR processing capability; or only one IMS AGW has the AR processingcapability, and the other IMS AGW does not have the AR processingcapability. In FIG. 2A, both the IMS AGW 1 and the IMS AGW 2 have the ARprocessing capability. The IMS AGW 1 serves the first terminal device,or the IMS AGW 1 is an access gateway of the first terminal device, orthe IMS AGW 1 is located in an IMS network to which the first terminaldevice belongs (or in which the first terminal device is located). TheIMS AGW 2 serves the second terminal device. In other words, the IMS AGW2 is an access gateway of the second terminal device, or the IMS AGW 2is located in an IMS network to which the second terminal devicebelongs. One access gateway may serve a plurality of terminal devices.For example, the IMS AGW 1 serves the first terminal device, and furtherserves a third terminal device (not shown in FIG. 2A). The IMS core mayinclude a call session control function (call session control function,CSCF) and/or a home subscriber server (home subscriber server, HSS). TheIMS core may further include another network element. Details are notdescribed in this embodiment of this application. The CSCF is a callcontrol center of the IMS core, and implements functions such as useraccess, authentication, session routing, and service triggering on an IPtransmission platform. The CSCF may include one or more of aserving-call session control function (S-CSCF), a proxy-CSCF (P-CSCF),and an interrogating-CSCF (I-CSCF). The HSS is configured to recordsubscription data (such as user information and service data) of a user.It is to be noted that an IMS core network element that provides aservice for the first terminal device may be different from an IMS corenetwork element that provides a service for the second terminal device,which, for example, is related to a location of the terminal device. Asan example, the IMS core network element mentioned in this embodiment ofthis application may include the CSCF.

In some embodiments, the AR communication system may further include athird-party server. The third-party server may be understood as a serverof a third-party provider. The third-party server is configured toprovide a target object required by the terminal device to perform ARcommunication, for example, a virtual object, or for another example, areal background object. The third-party server and the applicationserver each are configured with address information of each other,which, for example, may include an IP address and a port number. In apossible example, refer to FIG. 2B. The third-party server supportsestablishing a communication connection to the AR media processingnetwork element, which, for example, may be a hypertext transferprotocol (http) connection. In another possible example, the third-partyserver does not support establishing a communication connection to theAR media processing network element. As shown in FIG. 2C, thethird-party server establishes a communication connection to theapplication server, for example, a hypertext transfer protocol httpconnection.

As another example, one of the two terminal devices in AR communicationdoes not have the AR capability. For example, as shown in FIG. 3 , afirst terminal device has an AR capability, and a second terminal devicedoes not have the AR capability. In FIG. 3 , an IMS AGW 1 serving thefirst terminal device has the AR capability.

As still another example, one of the two terminal devices in ARcommunication does not have the AR capability. For example, as shown inFIG. 4 , a first terminal device has an AR capability, and a secondterminal device does not have the AR capability. In FIG. 4 , an IMS AGW2 serving the second terminal device has an AR capability, and an IMSAGW 1 serving the first terminal device does not have the AR capability.

In another possible implementation, an AR media processing networkelement may be a media resource function (MRF). As an example, as shownin FIG. 5 , an example in which an AR media processing network elementis an MRF is used, and an AR communication system includes the MRF, anIMS core, an application server (AS), and a third-party server is used.The MRF is usually classified as a media resource function controller(MRFC) and a media resource function processor (MRFP). In a conferencescenario, a terminal device communicates with another terminal devicevia the MRF. The MRF is a central node between a session message and amedia stream. In FIG. 5 , a first terminal device, a second terminaldevice, and a third terminal device are used as examples. A plurality ofterminal devices in an AR conference may all have an AR capability; orone or more of the terminal devices may have an AR capability, and otherterminal devices do not have the AR capability. For example, in FIG. 5 ,both the first terminal device and the second terminal device have theAR capability, and the third terminal device does not have the ARcapability. MRFs respectively serving a plurality of terminal devicesmay be the same or different. In FIG. 5 , MRFs serving the firstterminal device and the third terminal device are a same MRF, namely, anMRF 1, and an MRF serving the second terminal device is an MRF 2.Different MRFs serving a plurality of terminal devices that perform anAR conference may all have an AR processing capability; or only one ormore of the MRFs have an AR processing capability, and other MRFs do nothave the AR processing capability. Both the MRF 1 and the MRF 2 in FIG.5 have the AR processing capability. The MRF 1 serves the first terminaldevice and the third terminal device, and the MRF 2 serves the secondterminal device.

Optionally, in this embodiment of this application, the terminal devicemay establish different auxiliary data channels based on types oftransmitted AR auxiliary data. For example, when pose data needs to betransmitted, an auxiliary data channel 1 is established, and whenoperation data needs to be transmitted, an auxiliary data channel 2 isestablished. The terminal device may alternatively transmit differenttypes of AR auxiliary data through one auxiliary data channel.

An example in which the first terminal device and the second terminaldevice perform AR communication is used below, to describe a procedureof establishing an auxiliary data channel between the first terminaldevice and the AR media processing network element after the firstterminal device and the second terminal device trigger AR communication.For details, refer to FIG. 6 .

601: The first terminal device sends a first request message to an IMScore network element, where the first request message is for requestingfor establishing of an auxiliary data channel.

The first request message may carry a first description parameter of thefirst terminal device. The first description parameter is used by thefirst terminal device for establishing of the auxiliary data channelwith an AR media processing network element. Optionally, the firstrequest message further carries an address of the first terminal device.

For example, the first description parameter may use a sessiondescription protocol (SDP). Certainly, the first description parametermay alternatively use another description protocol. This is notspecifically limited in this application. In the following descriptions,an example in which the first description parameter uses the SDP isused. For ease of differentiation, SDP information carrying the firstdescription parameter is referred to as first SDP information (which maybe referred to as a first SDP for short). In other words, the firstrequest message carries the first SDP information of the first terminaldevice. The first SDP information includes the description parameter forestablishing the auxiliary data channel between the first terminaldevice and the AR media processing network element. For example, thefirst SDP information may include one or more of parameters such as aport number used by the first terminal device to transmit AR auxiliarydata, a media stream type, and a supported codec format. The mediastream type may include a video, an audio, and data channel (ARauxiliary data). For example, in this case, the first SDP informationmay include an m line for describing the AR auxiliary data.

In an example, the m line is described as follows.

-   -   m=application 10001 user datagram protocol (UDP)/data packet        transport layer security (DTLS) protocol/stream control        transmission protocol (SCTP) webrtc-datachannel.

The m line represents a description of a specific AR auxiliary datastream. 10001 may represent a port number of a quintuple, UDP/DTLS/SCTPrepresents a transmission protocol that the AR auxiliary data complieswith, and webrtc-datachannel represents an established auxiliary datachannel. The first SDP information may further include otherinformation, for example, an a line for describing AR auxiliary data. Inan example, the a line is described as follows.

-   -   a=dcmap:0 subprotocol=“ ”; ordered=true; max-retr=2; and        label=“AR-specific data”, where    -   dcmap represents a parameter set that needs to be negotiated at        two ends of the auxiliary data channel, where, for example,        whether the data channel supports reliable transmission, whether        a maximum quantity of retransmission times is set, and a        sub-protocol format transferred in the data channel need to be        negotiated. As shown in the foregoing example, 0 represents an        auxiliary data channel whose ID is 0, and subprotocol represents        a protocol type of AR auxiliary data transmitted through the        auxiliary data channel whose ID is 0, where, for example,        subprotocol=“ ” indicates that no protocol type is formulated at        the two ends. For example, subprotocol=“http” indicates that the        AR auxiliary data transmitted inside the auxiliary data channel        is in an http format. Ordered indicates whether reliable and        ordered transmission is performed, max-retr represents a maximum        quantity of retransmission times, where, for example, max-retr=2        indicates that the maximum quantity of retransmission times is        2, and label represents a description of the AR auxiliary data        transmitted through the auxiliary data channel.

In another example, the first SDP information may further include thefollowing.

-   -   a=dcsa:0 accept-types:text/plain, where    -   dcsa represents an SDP attribute of the auxiliary data channel,        0 represents an ID of a specific auxiliary data channel, and        accept-type represents a format of the AR auxiliary data        transmitted through the auxiliary data channel.        accept-types:text/plain indicates that the AR auxiliary data is        in a text format.

In an example, the first terminal device may send the first requestmessage to the IMS core network element in a procedure of establishingAR communication with the second terminal device. For example, the firstrequest message may be a session initiation protocol (SIP) invitemessage. The first request message may further carry address informationof the first terminal device. For example, the address information ofthe first terminal device may include an IP address and/or a port numberof the first terminal device.

Optionally, the first request message is further for requesting forestablishing of a media stream channel, where the media stream channelis for transmitting a media stream for performing AR communicationbetween the first terminal device and the second terminal device. Thefirst request message further carries a description parameter forestablishing the media stream channel between the first terminal deviceand the AR media processing network element. For example, the first SDPinformation further includes a description parameter for establishingthe media stream channel between the first terminal device and the ARmedia processing network element.

In another example, the first terminal device may send a first requestmessage to the IMS core network element in a procedure ofre-establishing an AR session with the second terminal device. Forexample, the first request message may be a SIP Re-invite (re-invite)message.

In still another example, the first terminal device may send a firstrequest message to the IMS core network element in a procedure ofupdating an AR session with the second terminal device. For example, thefirst request message may be a SIP update invite (update invite)message.

602: After receiving the first request message, the IMS core networkelement sends a second request message to the AR media processingnetwork element. The second request message is for requesting forestablishing of the auxiliary data channel between the AR mediaprocessing network element and the first terminal device. For example,the second request message further carries the first SDP information.

In this embodiment, the communication between the first terminal deviceand the second terminal device may be point-to-point communicationbetween the first terminal device and the second terminal device, or maybe conference communication established between the first terminaldevice and the second terminal device. The AR media processing networkelement is an IMS AGW having an AR processing capability or an MRF. Forexample, when point-to-point communication is performed between thefirst terminal device and the second terminal device, the AR mediaprocessing network element may be the IMS AGW. When AR conferencecommunication is performed between the first terminal device and thesecond terminal device, the AR media processing network element may bethe MRF. The AR media processing network element may be located in anIMS network to which the first terminal device belongs (or in which thefirst terminal device is located), and is configured to provide aservice for the first terminal device; or the AR media processingnetwork element may be located in an IMS network to which the secondterminal device belongs (or in which the second terminal device islocated), and is configured to provide a service for the second terminaldevice.

For example, when the AR media processing network element is the IMSAGW, the second request message may be an H248 protocol message, forexample, an H248 add request message. For another example, when the ARmedia processing network element is the MRF, the second request messagemay be a SIP invite message.

603: When receiving the second request message, the AR media processingnetwork element sends a second response message to the IMS core networkelement, where the second response message is a response messagecorresponding to the second request message. For example, the secondresponse message indicates to the IMS core network element that the ARmedia processing network element agrees upon the establishing of theauxiliary data channel. For example, the second response message carriesa second description parameter that is of the AR media processingnetwork element and that is for establishing the auxiliary data channel.The AR media processing network element may determine, from the firstdescription parameter, the second description parameter supported by theAR media processing network element. The second description parametermay use an SDP protocol, or may use another protocol. For ease ofdifferentiation, the SDP information of the AR media processing networkelement is referred to as second SDP information (which may be referredto as a second SDP for short). The second SDP information includes adescription parameter that is of the AR media processing network elementand that is for establishing the auxiliary data channel with the firstterminal device. For example, the second SDP information includesparameters such as a port of the auxiliary data channel corresponding tothe AR media processing network element, a media stream type, and asupported codec format. The media stream type may include a video (videostream), an audio (audio stream), and datachannel (AR auxiliary data).For example, in this case, the second SDP information may include an mline for describing the AR auxiliary data obtained by the AR mediaprocessing network element through negotiation, and a media stream typeof the m line is the AR auxiliary data.

604: When receiving the second response message, the IMS core networkelement sends a first response message to the first terminal device,where the first response message carries an address of the AR mediaprocessing network element. The first response message carries thesecond SDP information of the AR media processing network element.

In some embodiments, after receiving the first request message, the IMScore network element forwards the first request message to anapplication server. When determining that the first terminal device hassubscribed to an AR service, the application server may send addressinformation of the first terminal device to the AR media processingnetwork element via the IMS core network element. The application servermay add the address information of the application server to the firstrequest message and send the first request message to the IMS corenetwork element. In this way, the IMS core network element includes theaddress information of the application server in the second requestmessage sent to the AR media processing network element, so that whenthe AR media processing network element subsequently needs to obtaincontent provided by a third-party server, the AR media processingnetwork element may obtain the needed content from the third-partyserver via the AS.

For example, the application server may send a subscription queryrequest to an HSS, to query whether subscription information of thefirst terminal device includes information about the subscribed ARservice. The HSS may feed back the subscription information of the firstterminal device to the application server by using a subscription queryresponse. The subscription query request may be a diameter requestmessage. The subscription query response may be a diameter responsemessage.

In some other embodiments, after receiving the first request message,the MS core network element forwards the first request message to theapplication server. When determining that the first terminal device hassubscribed to the AR service, the application server may send the storedaddress information of the third-party server to the AR media processingnetwork element via the IMS core network element. The application servermay add the address information of the third-party server to the firstrequest message and send the first request message to the IMS corenetwork element. In this way, the IMS core network element includes theaddress information of the third-party server in the second requestmessage sent to the AR media processing network element, so that whenneeding to obtain content provided by the third-party serversubsequently, the AR media processing network element may obtain theneeded content from the third-party server.

In a possible implementation, the terminal device may learn in advancewhether the AR media processing network element has the AR processingcapability. For example, in a procedure of registering with a networkside, the terminal device learns whether the AR media processing networkelement having the AR processing capability exists in the IMS network inwhich the terminal device is located.

In another possible implementation, before sending the second requestmessage to the AR media processing network element, the IMS core networkelement first queries whether the AR media processing network elementhas the AR processing capability, for example, sends a query message tothe AR media processing network element. The query message is forquerying whether the AR media processing network element has the ARprocessing capability. After receiving the query message, the AR mediaprocessing network element sends indication information to the IMS corenetwork element, to indicate whether the AR media processing networkelement has the AR processing capability. When it is determined that theAR media processing network element has the AR processing capability,the second request message is then sent to the AR media processingnetwork element having the AR processing capability.

The AR media processing network element may be an IMS AGW having an ARprocessing capability or an MRF. For example, the AR media processingnetwork element is the IMS AGW. Refer to FIG. 7A and FIG. 7B. For easeof differentiation, an IMS AGW that provides a service for a firstterminal device is referred to as an IMS AGW 1, and an IMS AGW thatprovides a service for a second terminal device is referred to as an IMSAGW 2.

701 a: Refer to 601. Details are not described herein again.

702 a: When receiving a first request message of the first terminaldevice, the IMS core network element sends a query message 1 to the IMSAGW 1, where the query message 1 is for querying whether the IMS AGW 1has an AR processing capability.

For example, the query message 1 may include a parameter for queryingwhether the IMS AGW 1 has the AR processing capability. For example, theparameter may be “ifSupportforAR_flag”.

The query message 1 may be an H248 add request. The parameter“ifSupportforAR_flag” is added to the H248 add request.

703 a: When it is determined that the IMS AGW 1 has the AR processingcapability, the IMS AGW 1 sends a first indication to the IMS corenetwork element, where the first indication indicates that the IMS AGW 1has the AR processing capability. For example, the first indication maybe “ifSupportforAR_flag=true”. Optionally, the first indication may becarried in an H248 add response. The H248 add response includes“ifSupportforAR_flag=true”.

704 a: The IMS core network element sends a second request message tothe IMS AGW 1. Refer to 602. Details are not described herein again.

For 705 a and 706 a, refer to 603 and 604. Details are not describedherein again.

For 701 b in FIG. 7B, refer to 601. Details are not described hereinagain.

702 b: When receiving the first request message of the first terminaldevice, the IMS core network element sends a query message 2 to the IMSAGW 1, where the query message 2 is for querying whether the IMS AGW 1has the AR processing capability.

For example, the query message 2 may include a parameter for queryingwhether the IMS AGW 1 has the AR processing capability. For example, theparameter may be “ifSupportforAR_flag”. The query message 2 may be anH248 add request. The parameter “ifSupportforAR_flag” is added to theH248 add request.

703 b: When determining that the IMS AGW 1 does not have the ARprocessing capability, the IMS AGW 1 sends a second indication to theIMS core network element, where the second indication indicates that theIMS AGW 1 does not have the AR processing capability. For example, thesecond indication may be “ifSupportforAR_flag=false”. Optionally, thesecond indication may be carried in an H248 add response. The H248 addresponse includes “ifSupportforAR_flag=false”. Alternatively, the H248add response does not include the parameter “ifSupportforAR_flag=false”,to indicate that the IMS AGW 1 does not have the AR processingcapability.

704 b: The IMS core network element sends a second request message tothe IMS AGW 2.

For 705 b and 706 b, refer to 603 and 604. Details are not describedherein again.

In another possible implementation, the IMS core network elementincludes, in the second request message sent to the AR media processingnetwork element, a parameter for querying whether the AR mediaprocessing network element has the AR processing capability. Forexample, the second request message carries “ifSupportforAR_flag”. Whenthe AR media processing network element includes“ifSupportforAR_flag=true” in the second response message, it isdetermined that the AR media processing network element has the ARprocessing capability. Alternatively, if the AR media processing networkelement does not include the parameter “ifSupportforAR_flag=true” in thesecond response message but includes a second description parameter, itis determined that the AR media processing network element has the ARprocessing capability. Alternatively, if the AR media processing networkelement includes the parameter “ifSupportforAR_flag=false” in the secondresponse message, it is determined that the AR media processing networkelement does not have the AR processing capability.

The following describes, with reference to a specific applicationscenario, a procedure of establishing an auxiliary data channel providedin an embodiment of this application.

In a first possible application scenario, the auxiliary data channel isestablished in a process of creating an AR session. In this scenario,for example, the AR media processing network element having an ARprocessing capability is the IMS AGW 1 to which the first terminaldevice belongs.

FIG. 8A-1 and FIG. 8A-2 are a schematic flowchart of a communicationmethod in a first possible application scenario according to thisapplication.

801: A first terminal device sends a first request message to an IMScore network element (for example, an intermediate IMS 1).

In this embodiment, for example, the IMS core network element configuredto provide a service for the first terminal device is referred to as theintermediate IMS 1. In FIG. 8A-1 and FIG. 8A-2 , for example, the firstrequest message is a SIP INVITE message. For ease of differentiation,the first request message is referred to as a SIP INVITE 1 as anexample. The SIP INVITE 1 is for requesting for establishing of an ARsession. The SIP INVITE 1 includes a first SDP of the first terminaldevice. The first SDP includes a description parameter used by the firstterminal device for establishing of an auxiliary data channel. The firstSDP further includes a description parameter used by the first terminaldevice for establishing of a media stream channel. The SIP INVITE 1further includes address information of the first terminal device,including an IP address and a port number of the first terminal device.In FIG. 8A-1 and FIG. 8A-2 , the IP address is IP10, and the port numberis P10. For example, the first SDP includes parameters such as addressinformation (for example, a port number), a type of a media stream, anda supported codec format. The media stream type may include a video(video stream), an audio (audio stream), and datachannel (AR auxiliarydata). For example, in this case, the first SDP information includes anm line for describing AR auxiliary data, and may further include an mline for describing audio data and an m line for describing video data.The m line describes a media stream (a video stream, an audio stream, ora stream including AR auxiliary data).

As an example, the following three m lines are defined, and arerespectively an m line for the video stream, an m line for the audiostream, and an m line for the stream including the AR auxiliary data.

m=video 49154 RTP/AVPF 101 102. The 1^(st) parameter in the m linerepresents a media name, and indicates that a supported media streamtype is the video (video). The 2^(nd) parameter represents a portnumber, and indicates that the video stream is sent through a port49154. The 3^(rd) parameter represents a transmission protocol, in whicha real-time transport protocol (real-time transport protocol, RTP)/audiovideo profile feedback (audio video profile feedback, AVPF) is used. The4^(th) and 5^(th) parameters represent supported payload type numbers.

m=audio 49152 RTP/AVP 98 97 99. The 1^(st) parameter in the m linerepresents a media name, and indicates that a supported media streamtype is audio. The 2^(nd) parameter represents a port number, andindicates that the audio stream is sent through a port 49152. A thirdparameter represents a transmission protocol, in which a real-timetransport protocol (RTP)/audio video profile (AVP) is used. The 4^(th)to 6^(th) parameters represent supported payload type numbers.

-   -   m=application 10001 UDP/DTLS/SCTP webrtc-datachannel.

The m line represents a description of a specific AR auxiliary datastream. wool may represent a port number of a quintuple, UDP/DTLS/SCTPrepresents a transmission protocol that the AR auxiliary data complieswith, and webrtc-datachannel represents an established auxiliary datachannel.

The first SDP information may further include other information. Forspecific descriptions of the first SDP, refer to the descriptions in theembodiment corresponding to FIG. 6 . Details are not described hereinagain.

802: The intermediate IMS 1 receives the SIP INVITE 1 from the firstterminal device, and forwards the SIP INVITE 1 to an AS.

For example, after receiving the SIP INVITE 1, the intermediate IMS 1determines, based on subscription data of the first terminal device,that the first terminal device has subscribed to an AR mediaaugmentation service, and further forwards the SIP INVITE 1 to the AS.The AS provides a control service for the AR media augmentation service.

803: The AS sends a SIP INVITE 2 to the intermediate IMS 1.

After receiving the SIP INVITE 1, the AS determines that the AR servicesubscribed by the first terminal device triggers a corresponding ARservice. The AS triggers establishment of an AR session via theintermediate IMS 1, in other words, triggers the SIP INVITE 2 to theintermediate IMS 1. Optionally, the SIP INVITE 2 triggered by the AS andthe SIP INVITE 1 sent by the first terminal device may be a samemessage, or may be different messages.

In some embodiments, the AS identifies that the first terminal devicetriggers the AR service (or initiates the AR session), and the AS mayrecord a session identifier of the AR session as the AR session, andestablish a binding relationship between the session identifier and anidentifier of the first terminal device. After the binding relationshipis established, the binding relationship may be used subsequently when atarget object is obtained from a third-party server via the AS. Detailsare described below, and are not described herein again.

804: The intermediate IMS 1 sends, to the IMS AGW 1, a query message forquerying whether the IMS AGW 1 has an AR capability. For ease ofdifferentiation, the query message for querying whether the IMS AGW 1has the AR capability is referred to as a query message 1. The querymessage 1 may be an H248 add request 1 (req1). The H248 add request 1may be for requesting for establishing of a bearer. For example, theH248 add request 1 carries a parameter for querying whether the IMS AGW1 has an AR processing capability. For example, the parameter forquerying whether the IMS AGW 1 has the AR processing capability may be“IfSupportforAR_flag”.

Optionally, the H248 add request 1 is received, and an outgoingtermination T2 (outgoing termination T2) is created. Specifically, anoutgoing termination of the IMS AGW 1 is created. The IMS AGW 1 feedsback, to the intermediate IMS 1, address information corresponding tothe termination T2 and indication information indicating whether the IMSAGW 1 has the AR processing capability.

805: After receiving the H248 add request 1, the IMS AGW 1 sends a queryresponse 1 to the intermediate IMS 1, where the query response 1includes the indication information indicating whether the IMS AGW 1 hasthe AR processing capability. For example, the query response 1 may bean H248 add response 1. The intermediate IMS 1 sends the H248 addresponse (resp) 1, where the H248 add response 1 includes the indicationinformation and the address information corresponding to the terminationT2. For example, the address information corresponding to thetermination T2 includes an IP address and a port number. For example,the IP address is IP20, and the port number is P20. For example, the IMSAGW 1 has the AR processing capability, and the H248 add responseincludes “IfSupportforAR_flag=True”; or the IMS AGW 1 has the ARprocessing capability, and the H248 add response carries“IfSupportforAR_flag=False” or does not carry the parameter. In thisembodiment, for example, the IMS AGW 1 has the AR processing capability.

806: After receiving the H248 add response 1, the intermediate IMS 1identifies that the IMS AGW 1 has the AR processing capability, andtriggers creation of an AR session to the intermediate IMS 1. Theintermediate IMS 1 sends a SIP INVITE 3, where the SIP INVITE 3 carriesthe address information corresponding to the termination T2 of the IMSAGW 1.

It is to be noted that, if the intermediate IMS 1 identifies that theIMS AGW 1 has the AR processing capability, only an auxiliary datachannel needs to be established between the first terminal device andthe IMS AGW 1. In this case, an AR session establishment request sent toa network side of a second terminal device does not need to carry an SDPfor requesting for establishing of the auxiliary data channel.

807: The intermediate IMS 2 sends, to an AS 2, the SIP INVITE 3 thatcarries the address information corresponding to the termination T2 ofthe IMS AGW 1.

808: The AS 2 sends, to the intermediate IMS 2, the SIP INVITE 3 thatcarries the address information corresponding to the termination T2 ofthe IMS AGW 1.

809: The intermediate IMS 2 sends, to an IMS AGW 2, an H248 add request2 indicating the IMS AGW 2 to create an outgoing termination T2. Forexample, an IP address and a port number are included. For example, theIP address is IP30, and the port number is P30.

810: The IMS AGW 2 creates an outgoing termination T3, and sends, to theintermediate IMS 2, an H248 add resp 2 that carries address informationcorresponding to the termination T2 of the IMS AGW 2.

811: The IMS AGW 2 sends a SIP INVITE 4 to the second terminal device,where the SIP INVITE 4 includes the address information corresponding tothe termination T2 of the IMS AGW 2.

812: The second terminal device sends a SIP 180 to the IMS AGW 2, wherethe SIP 180 carries address information (IP1a, P1a) of the secondterminal device.

813: The intermediate IMS 2 sends, to the IMS AGW 2, an H248 mod request1 indicating the IMS AGW 2 to configure an outgoing termination T1. Forexample, an association relationship between address information of T1of the IMS AGW 2 and the address information of the second terminaldevice is established.

814: The IMS AGW 2 configures the outgoing termination T3, and sends anH248 mod resp 1 to the intermediate IMS 2. The H248 mod resp 1 may carrythe address information corresponding to the termination T2 of the IMSAGW 2 and the address information of the second terminal device.

815: The intermediate IMS 2 sends, to the IMS AGW 2, an H248 add request3 indicating the IMS AGW 2 to create an incoming termination T2. Forexample, the H248 add request 3 includes the address informationcorresponding to the termination T2 of the IMS AGW 1.

816: The IMS AGW 2 creates an incoming termination T2, and sends, to theintermediate IMS 2, an H248 add resp 2 that carries the addressinformation (IP40, P40) corresponding to the termination T2 of the IMSAGW 2.

817: The intermediate IMS 1 receives the SIP 180 from the secondterminal device, where, for example, the SIP 180 may include the addressinformation of the second terminal device. For example, an IP address ofthe second terminal device is IP1a, and a port number of the secondterminal device is P1a. The intermediate IMS 1 records the IP addressand the port number of the second terminal device.

818: The intermediate IMS 1 sends a configuration modification requestto the IMS AGW 1. For example, the configuration modification requestmay be an H248 mod req 3. The H248 mod req 3 includes the IP address(IP1a) and the port number (P1a) of the second terminal device. The H248mod req 3 is for requesting the IMS AGW 1 to configure an outgoingtermination T2.

819: The IMS AGW 1 configures the outgoing termination T2 based on theaddress information of the second terminal device, and sends aconfiguration modification response to the intermediate IMS 1, where theconfiguration modification response may be an H248 mod response 3.

820: The intermediate IMS 1 receives the configuration modificationresponse from the IMS AGW 1, and sends a second request message to theIMS AGW 1. The second request message is for requesting for establishingof an auxiliary data channel, or may be for requesting to create anincoming termination T1, for example, by using an H248 add req 4. TheH248 add req 4 is further for requesting for establishing of theauxiliary data channel. The H248 add req 4 includes the first SDP of thefirst terminal device and the address information of the first terminaldevice.

After receiving the H248 add req 4, the IMS AGW 1 creates an incomingtermination T1, and agrees upon the establishing of the auxiliary datachannel.

821: The IMS AGW 1 sends, to the intermediate IMS 1, a second responsemessage that includes a response to information about the incomingtermination T1, and may further include a response to the establishmentof the auxiliary data channel. The second response message may be anH248 add response 4. The H248 add response 4 includes second SDPinformation of the IMS AGW 1 and address information of the IMS AGW 1.For example, the address information of the IMS AGW 1 includes an IPaddress (IP2a) and a port number (P2a) of the IMS AGW 1. Optionally, theH248 add response 4 may further include the address information of thefirst terminal device.

For example, the second SDP information includes the m line fordescribing the auxiliary data channel that needs to be established bythe IMS AGW 1. In an example, a response of the IMS AGW 1 to the m lineof the first terminal device may be described as follows.

-   -   m=application wool UDP/DTLS/SCTP webrtc-datachannel.

822: The intermediate IMS 1 records the address information of thetermination T1. of the IMS AGW 1, and determines the second SDPinformation for establishing the auxiliary data channel, where a firstresponse message sent to the AS includes the second SDP information andthe address information of the IMS AGW 1. For example, the firstresponse message may be a SIP 180. The SIP 180 includes the second SDPinformation and the address information of the IMS AGW 1.

823: The AS forwards the SIP 180 to the intermediate IMS 1.

824: The intermediate IMS 1 forwards the SIP 180 to the first terminaldevice.

In some embodiments, in step 803, after receiving the SIP INVITE 1, whendetermining that the first terminal device has subscribed to the ARservice, the AS may send address information of the AS to the IMS AGW 1via the intermediate IMS 1. The AS may add the address information ofthe AS to the SIP INVITE 1 and send the SIP INVITE 1 to the intermediateIMS 1. In this way, the second request message (H248 add request) sentby the intermediate IMS 1 to the IMS AGW 1 carries the addressinformation of the AS, so that when the IMS AGW 1 subsequently needs toobtain content provided by the third-party server, the IMS AGW 1 mayobtain the needed content from the third-party server via the AS.

For example, the AS may send a subscription query request to an HSS, toquery whether the subscription information of the first terminal deviceincludes information about the subscribed AR service. The HSS may feedback the subscription information of the first terminal device to the ASby using a subscription query response. The subscription query requestmay be a diameter request message. The subscription query response maybe a diameter response message.

In some other embodiments, after receiving the SIP INVITE 1, theintermediate IMS 1 forwards the SIP INVITE 1 to the AS. When determiningthat the first terminal device has subscribed to the AR service, the ASmay send stored address information of the third-party server to the IMSAGW 1 via the intermediate IMS 1. The AS may add the address informationof the third-party server to the first request message and send thefirst request message to the intermediate IMS 1. In this way, the secondrequest message (H248 add request) sent by the intermediate IMS 1 to theIMS AGW 1 carries the address information of the third-party server, sothat when the IMS AGW 1 subsequently needs to obtain content provided bythe third-party server, the IMS AGW 1 may obtain the needed content fromthe third-party server.

With reference to FIG. 8B, the following describes the foregoingtermination T1 and termination T2. The termination T1. is a terminationthat is on the IMS AGW and that is for connecting to a terminal devicethrough an access network. The termination T2 is a termination that ison the IMS AGW and that is for connecting to an IMS core network. TheIMS AGW can establish a connection with a communication peer terminaldevice through the IMS core network.

In a second possible application scenario, establishment of theauxiliary data channel is triggered by triggering a process ofre-establishing an AR session. In this scenario, an example in which theIMS AGW 1 to which the first terminal device belongs has the ARprocessing capability is used.

FIG. 9 is a schematic flowchart of a communication method in a secondpossible application scenario according to this application.

901: After an AR session is established between a first terminal deviceand a second terminal device, the first terminal device sends a firstrequest message to an IMS core network element (for example, anintermediate IMS 1). In FIG. 9 , an example in which the first requestmessage is a SIP re-establishment request (Re-INVITE 1) is used. The SIPRe-INVITE 1 is for requesting to create an auxiliary data channel. TheSIP Re-INVITE 1 includes a first SDP of the first terminal device. Fordescriptions of the first SDP of the first terminal device, refer to thedescriptions in the embodiment corresponding to FIG. 6 . Details are notdescribed herein again.

902: The intermediate IMS 1 receives the SIP Re-INVITE 1 from the firstterminal device, and forwards the SIP Re-INVITE 1 to an AS.

903: The AS sends the SIP Re-INVITE 1 to the intermediate IMS 1.

904: The intermediate IMS 1 sends a query message 1 to an IMS AGW 1. Thequery message 1 may be carried in an H248 mod req. The H248 mod req isfor requesting to establish a bearer. For example, the H248 mod req mayfurther carry a parameter for querying whether the IMS AGW 1 has an ARprocessing capability. For example, the parameter for querying whetherthe IMS AGW 1 has the AR processing capability may be“IfSupportforAR_flag”.

Optionally, after receiving the H248 mod req, the IMS AGW 1 indicates tothe intermediate IMS 1 whether the IMS AGW 1 has the AR processingcapability.

905: After receiving the H248 mod request, the IMS AGW 1 sends a queryresponse 1 to the intermediate IMS 1, where the query response 1includes indication information indicating whether the IMS AGW 1 has theAR processing capability. For example, the intermediate IMS 1 sends anH248 mod response, where the H248 mod response includes the indicationinformation. For example, the IMS AGW 1 has the AR processingcapability, and the H248 mod response includes“IfSupportforAR_flag=True”; or the IMS AGW 1 has the AR processingcapability, and the H248 mod response carries“IfSupportforAR_flag=False” or does not carry the parameter. In thisembodiment, for example, the IMS AGW 1 has the AR processing capability.

906: After receiving the H248 mod response, the intermediate IMS 1identifies that the IMS AGW 1 has the AR processing capability, in otherwords, only an auxiliary data channel needs to be established betweenthe first terminal device and the IMS AGW 1. In this case, an AR sessionestablishment request sent to the second terminal device does not needto carry a request for requesting for establishing of the auxiliary datachannel. The AR session establishment request may be a SIP Re-INVITE 2.

For example, the intermediate IMS 1 sends the SIP Re-INVITE 2 to thesecond terminal device via the intermediate IMS 2 to which the secondterminal device belongs.

907: The intermediate IMS 1 receives, from the second terminal device, aSIP response, for example, a SIP 200 OK.

908: The intermediate IMS 1 sends a second request message to the IMSAGW 1, where the second request message may be for requesting forestablishing of an auxiliary data channel, and is further for modifyingincoming termination T1 information. For example, the intermediate IMS 1sends, by using an H248 add req, a request for modifying the incomingtermination T1. information to the IMS AGW 1. The H248 add req isfurther for requesting for establishing of the auxiliary data channel.The H248 add req includes the first SDP of the first terminal device.

After receiving the H248 add req, the IMS AGW 1 modifies the incomingtermination T1 information, and agrees upon the establishing of theauxiliary data channel.

909: The IMS AGW 1 sends a response for modifying the incomingtermination T1 information to the intermediate IMS 1. The response formodifying the incoming termination T1 information may be an H248 addresponse. The H248 add response includes second SDP information of theIMS AGW 1. For descriptions of the second SDP, refer to the descriptionsin the embodiment corresponding to FIG. 6 . Details are not describedherein again.

910: The intermediate IMS 1 extracts the second SDP information of theIMS AGW and sends, to the AS, an AR session establishment response thatincludes the second SDP information. For example, the AR sessionestablishment response may be a SIP 200 OK. The SIP 200 OK includes thesecond SDP information.

911: The AS forwards the SIP 200 OK to the intermediate IMS 1.

912: The intermediate IMS 1 forwards the SIP 200 OK to the firstterminal device.

In some embodiments, in step 903, after receiving the SIP Re-INVITE 1,when determining that the first terminal device has subscribed to an ARservice, the AS may send address information of the AS to the IMS AGW 1via the intermediate IMS 1. The AS may add the address information ofthe AS to the SIP Re-INVITE 1 and send the SIP Re-INVITE 1 to theintermediate IMS 1. In this way, the second request message (H248 addrequest) sent by the intermediate IMS 1 to the IMS AGW 1 carries theaddress information of the AS, so that when the IMS AGW 1 subsequentlyneeds to obtain content provided by the third-party server, the IMS AGW1 may obtain the needed content from the third-party server via the AS.

In some other embodiments, after receiving the SIP Re-INVITE 1, theintermediate IMS 1 forwards the SIP Re-INVITE 1 to the AS. Whendetermining that the first terminal device has subscribed to the ARservice, the AS may send stored address information of the third-partyserver to the IMS AGW 1 via the intermediate IMS 1. The AS may add theaddress information of the third-party server to the first requestmessage and send the first request message to the intermediate IMS 1. Inthis way, the second request message (H248 add request) sent by theintermediate IMS 1 to the IMS AGW 1 carries the address information ofthe third-party server, so that when the IMS AGW 1 subsequently needs toobtain content provided by the third-party server, the IMS AGW 1 mayobtain the needed content from the third-party server.

In a third possible application scenario, the auxiliary data channel isestablished in a process of creating an AR session. In this scenario,for example, the IMS AGW 1 to which the first terminal device belongsdoes not have the AR processing capability, and the IMS AGW 2 to whichthe second terminal device belongs has the AR processing capability.

FIG. 10A and FIG. 10B are a schematic flowchart of a communicationmethod in a third possible application scenario according to thisapplication.

1001 to 1004: Refer to 801 to 804. Details are not described hereinagain.

1005: After receiving an H248 add request, an IMS AGW 1 sends, to anintermediate IMS 1, indication information indicating whether the IMSAGW 1 has an AR processing capability. For example, the intermediate IMS1 sends an H248 add response, where the H248 add response carries theindication information and address information corresponding to atermination T2. For example, the address information corresponding tothe termination T2 includes an IP address and a port number. Forexample, the IP address is IP20, and the port number is P20. Forexample, the IMS AGW 1 has the AR processing capability, and the H248add response includes “IfSupportforAR_flag=True”; or the IMS AGW 1 hasthe AR processing capability, and the H248 add response carries“IfSupportforAR_flag=False” or does not carry this parameter. In thisembodiment, for example, the IMS AGW 1 does not have the AR processingcapability.

After receiving the H248 add response, the intermediate IMS 1 identifiesthat the IMS AGW 1 does not have the AR processing capability, and sendsa second request message (SIP INVITE 2) to an IMS AGW 2 via theintermediate IMS 2 to which a second terminal device belongs and an AS.The SIP INVITE 2 sent to the second terminal device does not need tocarry a request for requesting for establishing of an auxiliary datachannel. The SIP INVITE 2 sent to the second terminal device carries theaddress information corresponding to the termination T2 of the IMS AGW1.

1006 to 1008: Refer to 806 to 808. Details are not described hereinagain.

1009: After receiving the SIP INVITE 3 from the AS, the intermediate IMS2 may send a query message 2 to the IMS AGW 2.

The query message 2 may be carried in an H248 add req 2. For example,the H248 add req 2 may further carry a parameter for querying whetherthe IMS AGW 2 has the AR processing capability. For example, theparameter for querying whether the IMS AGW 2 has the AR processingcapability may be “IfSupportforAR_flag”. The H248 add req 2 furtherindicates to create an outgoing termination T2. For details, refer to809. Details are not described herein again.

Optionally, after receiving the H248 add req 2, the IMS AGW 2 indicatesto the intermediate IMS 2 whether the IMS AGW 2 has the AR processingcapability.

1010: After receiving the H248 add request 2, the IMS AGW 2 sends aquery response 2 to the intermediate IMS 2, where the query response 2includes indication information 2 indicating whether the IMS AGW 2 hasthe AR processing capability. For example, the IMS AGW 2 sends an H248add response 2 (namely, the query response 2), where the H248 addresponse 2 includes the indication information 2. For example, the IMSAGW 2 has the AR processing capability, and the H248 add response 2includes “IfSupportforAR_flag=True”; or the IMS AGW 2 has the ARprocessing capability, and the H248 add response carries“IfSupportforAR_flag=False” or does not carry this parameter. In thisembodiment, for example, the IMS AGW 2 has the AR processing capability.The H248 add response 2 further includes address information of T2.

1011 to 1014: Refer to 811 to 814. Details are not described hereinagain.

1015: The intermediate IMS 2 sends, to the IMS AGW 2, an H248 addrequest 3 indicating the IMS AGW 2 to create an incoming termination T1The H248 add request 3 is further for requesting for establishing of anauxiliary data channel with the first terminal device. The H248 addrequest 3 includes the first SDP of the first terminal device. Forexample, the H248 add request 3 further includes the address informationcorresponding to the termination T2 of the IMS AGW 1.

1016: The IMS AGW 2 creates an incoming termination T2, and sends anH248 add resp 3 to the intermediate IMS 2, where the H248 add resp 2includes an acknowledgment for creating the auxiliary data channel, forexample, a second SDP of the IMS AGW 2, and further includes addressinformation (IP40, P40) corresponding to a termination T2 of the IMS AGW2.

1017: The intermediate IMS 2 sends a SIP 180 to the intermediate IMS 1.

1018: The intermediate IMS 1 sends a configuration modification requestto the IMS AGW 1. For example, the configuration modification requestmay be an H248 mod req 3. The H248 mod req 3 is for requesting the IMSAGW 1 to configure an outgoing termination T2.

1019: The IMS AGW 1 configures an outgoing termination T2, and sends aconfiguration modification response message to the intermediate IMS 1,where the configuration modification response message may be an H248 modresponse 3.

1020: The intermediate IMS 1 receives the configuration modificationresponse message from the IMS AGW 1, and then initiates to the IMS AGW 1to create an incoming termination T1. For example, a request message forcreating the incoming termination T1 carries a request for establishingan auxiliary data channel and address information of the first terminaldevice. The request message for creating the incoming termination T1 maybe an H248 add req 4.

1021: The IMS AGW 1 sends an H248 add response 4 to the intermediateIMS 1. The H248 add response 4 includes address information of the IMSAGW 1. For example, the address information of the IMS AGW 1 includes anIP address (IP2a) and a port number (P2a) of the IMS AGW 1. Optionally,the H248 add response 4 may further include the address information ofthe first terminal device.

1022: The intermediate IMS 1 records address information of thetermination T1 of the IMS AGW 1, and sends, to the AS, a SIP 180 thatincludes the second SDP information of the IMS AGW 2 and the addressinformation of the IMS AGW 1.

1023: The AS forwards the SIP 180 to the intermediate IMS 1.

1024: The intermediate IMS 1 forwards the SIP 180 to the first terminaldevice.

In some embodiments, after receiving a SIP INVITE 3, the AS 2 may sendaddress information of the AS 2 to the IMS AGW 2 via the intermediateIMS 2. The AS 2 may add the address information of the AS 2 to the SIPINVITE 3 and send the SIP INVITE 3 to the intermediate IMS 2. In thisway, the H248 add req 3 sent by the intermediate IMS 2 to the IMS AGW 2carries the address information of the AS 2, so that when the IMS AGW 2subsequently needs to obtain content provided by a third-party server,the IMS AGW 2 may obtain the needed content from the third-party servervia the AS.

In some other embodiments, after receiving the SIP INVITE 3, theintermediate IMS 2 forwards the SIP INVITE 3 to the AS 2. The AS 2 maysend stored address information of the third-party server to the IMS AGW2 via the intermediate IMS 2. The intermediate IMS 2 may add the addressinformation of the third-party server to an H248 add req 3 and send theH248 add req 3 to the IMS AGW 2, so that when the IMS AGW 2 subsequentlyneeds to obtain content provided by the third-party server, the IMS AGW1 may obtain the needed content from the third-party server.

In a fourth possible application scenario, the auxiliary data channel isestablished in a process of creating an AR conference. In this scenario,an example in which the MRF to which the first terminal device belongshas the AR processing capability is used.

FIG. 11 is a schematic flowchart of a communication method in a fourthpossible application scenario according to this application.

1101 to 1103: Refer to 801 to 803. Details are not described hereinagain.

1104: An intermediate IMS 1 identifies that a first terminal deviceinitiates an AR conference, and sends a second request message to anMRF, where the second request message includes a first SDP of the firstterminal device. The second request message may be a SIP INVITE 2.

1105: When it is determined that the MRF has an AR processingcapability, the MRF sends a second response message to the intermediateIMS 1, where the second response message includes a second SDP of theMRF. When it is determined that the MRF does not have an AR processingcapability, the MRF ignores a request of the first terminal device forestablishing an auxiliary data channel. For example, the second responsemessage may be a SIP 180.

1106: The intermediate IMS 1 sends a first response message to an AS,where the first response message may be the SIP 180. The first responsemessage includes the second SDP of the MRF.

1107: The AS sends a first response message to the intermediate IMS 1.

1108: The intermediate IMS 1 sends the first response message to thefirst terminal device.

In some embodiments, in step 1103, after receiving a SIP INVITE 1, whendetermining that the first terminal device has subscribed to an ARservice, the AS may send address information of the AS to the MRF viathe intermediate IMS 1. The AS may add the address information of the ASto the SIP INVITE 1 and send the SIP INVITE 1 to the intermediate IMS 1.In this way, the second request message (SIP INVITE 2) sent by theintermediate MRF 1 to the MRF carries the address information of the AS,so that when the MRF subsequently needs to obtain content provided by athird-party server, the MRF may obtain the needed content from thethird-party server via the AS.

In some other embodiments, after receiving the SIP INVITE 1, theintermediate IMS 1 forwards the SIP INVITE 1 to the AS. When determiningthat the first terminal device has subscribed to the AR service, the ASmay send stored address information of the third-party server to the MRFvia the intermediate IMS 1. The AS may add the address information ofthe third-party server to the first request message and send the firstrequest message to the intermediate IMS 1. In this way, the secondrequest message (SIP INVITE 2) sent by the intermediate IMS 1 to the MRFcarries the address information of the third-party server, so that whenthe IMS AGW 1 subsequently needs to obtain content provided by thethird-party server, the MRF may obtain the needed content from thethird-party server.

With reference to an embodiment, the following describes a procedure ofaugmenting a media stream by completing establishment of an auxiliarydata channel in embodiments of this application.

FIG. 12A describes a procedure of augmenting a media stream in a processof performing AR communication between a first terminal device and asecond terminal device. An auxiliary data channel is established betweenthe first terminal device and an AR media processing network element. Amedia stream channel is further established between the first terminaldevice and the second terminal device. The media stream channel passesthrough the AR media processing network element.

1201: The first terminal device sends AR auxiliary data to the AR mediaprocessing network element through the auxiliary data channel in theprocess of performing AR communication with the second terminal device.The AR media processing network element is located in an IMS network ofthe first terminal device or is located in an IMS network of the secondterminal device. The AR media processing network element may be an IMSAGW or an MRF.

1202: The first terminal device sends the media stream to the AR mediaprocessing network element through the media stream channel. The mediastream is generated in the process of performing AR communicationbetween the first terminal device and the second terminal device.

1203: The AR media processing network element performs media augmentedprocessing on the media stream based on the AR auxiliary data to obtainan AR media stream.

1204: The AR media processing network element sends the AR media streamto the first terminal device. The first terminal device receives the ARmedia stream.

Optionally, in 1205, the AR media processing network element sends theAR media stream to the second terminal device through the media streamchannel. The second terminal device receives the AR media stream.

For example, the AR media processing network element is the IMS AGW. AnIMS AGW serving the first terminal device is referred to as an IMS AGW1, and an IMS AGW that provides a service for the second terminal deviceis referred to as an IMS AGW 2. As shown in FIG. 12B, both the IMS AGW 1and the IMS AGW 2 have an AR processing capability. In this application,an AR auxiliary data proximity processing principle is used. If ARauxiliary data (non-media data) received by the IMS AGW 1 is datarelated to the first terminal device, for example, pose data of a userof the first terminal device, the AR auxiliary data is used only forperforming pose rendering on a media stream shared by the first terminaldevice. Therefore, the IMS AGW 1 obtains 1{a+b} (AR media stream)through combination and rendering based on 1 (an original media stream),a (non-media data such as the pose data of the user of the firstterminal device) and b (non-media data such as user annotation dataacquired by the first terminal device), and returns the AR media streamto the first terminal device. The IMS AGW 1 then obtains 1{b} (AR mediastream) through rendering based on 1 (original media stream) and b(non-media data such as user annotation data acquired by UE 1), andsends the AR media stream to the IMS AGW 2. The IMS AGW 2 finallyobtains a media stream 1{b+c} (AR media stream) through rendering basedon the received media stream and c (non-media data) of UE 2, and sendsthe AR media stream to the UE 2. As shown in FIG. 12C, if the IMS AGW 2does not have the AR media processing capability, the IMS AGW 2transfers c (the non-media data) of the UE2 to the IMS AGW 1 for mediaaugmented processing.

With reference to a specific application scenario, the followingdescribes a solution for augmenting a media stream through the auxiliarydata channel.

For example, the user of the first terminal device shares a video with auser of the second terminal device. The user of the first terminaldevice and the user of the second terminal device can simultaneouslywatch the video shared by the first terminal device. In the followingdescriptions, for example, the AR media processing network element isthe IMS AGW.

Refer to FIG. 13 . For example, both an IMS AGW 1 to which a firstterminal device belongs and an IMS AGW 2 to which a second terminaldevice belongs have an AR processing capability. Establishment of anauxiliary data channel 1 has been completed between the first terminaldevice and the IMS AGW 1 to which the first terminal device belongs.Establishment of an auxiliary data channel 2 has been completed betweenthe second terminal device and the IMS AGW 2 to which the secondterminal device belongs. In addition, a media stream channel isestablished between the first terminal device and the second terminaldevice. It may be considered that the media stream channel is dividedinto three segments, which respectively include a media stream channelbetween the first terminal device and the IMS AGW 1, a media streamchannel between the IMS AGW 1 and the IMS AGW 2, and a media streamchannel between the IMS AGW 2 and the second terminal device. The firstterminal device has the AR capability. For example, the first terminaldevice supports a user in annotating a shared picture. For ease ofdifferentiation, a user of the first terminal device is referred to as auser 1, and a user of the second terminal device is referred to as auser 2.

1301: The first terminal device sends a media stream to the IMS AGW 1through the media stream channel.

1302: The first terminal device obtains pose data 1 generated throughannotation performed by the user 2 on a display interface correspondingto the media stream, and sends the pose data 1 as AR auxiliary data tothe IMS AGW 1 through the auxiliary data channel 1.

1303: The IMS AGW 1 renders the media stream based on the pose data 1 toobtain an AR media stream 1.

1304 a: The IMS AGW 1 sends the AR media stream 1 to the IMS AGW 2.

1304 b: The IMS AGW 1 sends the AR media stream 1 to the first terminaldevice.

1305: The IMS AGW 2 sends the AR media stream 1 to the second terminaldevice through the media stream channel.

In this way, both the user 1 of the first terminal device and the user 2of the second terminal device can watch the display interface on whichthe user 1 performs annotation.

In some embodiments, the second terminal device also has the ARcapability. For example, the second terminal device supports a user inannotating a shared picture.

1306: The second terminal device obtains pose data 2 generated throughannotation performed by the user 2 on the display interfacecorresponding to the media stream, and sends the pose data 2 as ARauxiliary data to the IMS AGW 2 through the auxiliary data channel 2.

1307: The IMS AGW 2 renders the media stream based on the pose data 2 toobtain an AR media stream 2.

1308 a: The IMS AGW 2 sends the AR media stream 2 to the IMS AGW 1.

1308 b: The IMS AGW 2 sends the AR media stream 2 to the second terminaldevice.

1309: The IMS AGW 1 sends the AR media stream 2 to the first terminaldevice through the media stream channel 1. In this way, both the user 1of the first terminal device and the user 2 of the second terminaldevice can watch the display interface on which the user 2 performsannotation.

Refer to FIG. 14 . For example, an IMS AGW 1 to which a first terminaldevice belongs does not have an AR processing capability, and an IMS AGW2 to which a second terminal device belongs has the AR processingcapability. Establishment of an auxiliary data channel 1 has beencompleted between the first terminal device and the IMS AGW 2 to whichthe second terminal device belongs. Establishment of an auxiliary datachannel 2 has been completed between the second terminal device and theIMS AGW 2 to which the second terminal device belongs. In addition, amedia stream channel is established between the first terminal deviceand the second terminal device. It may be considered that the mediastream channel is divided into three segments, which respectivelyinclude a media stream channel between the first terminal device and theIMS AGW 1, a media stream channel between the IMS AGW 1 and the IMS AGW2, and a media stream channel between the IMS AGW 2 and the secondterminal device. The first terminal device has the AR capability. Forexample, the first terminal device supports a user in annotating ashared picture. For ease of differentiation, a user of the firstterminal device is referred to as a user 1, and a user of the secondterminal device is referred to as a user 2.

1401: The first terminal device sends a media stream to the IMS AGW 1through the media stream channel.

1402: The first terminal device obtains pose data 1 generated throughannotation performed by the user 2 on a display interface correspondingto the media stream, and sends the pose data 1 as AR auxiliary data tothe IMS AGW 2 through the auxiliary data channel 1.

1403: The IMS AGW 2 renders the media stream based on the pose data 1 toobtain an AR media stream 1.

1404 a: The IMS AGW 2 sends the AR media stream 1 to the IMS AGW 1through the media stream channel.

1405: The IMS AGW 1 sends the AR media stream 1 to the first terminaldevice through the media stream channel.

1404 b: The IMS AGW 2 sends the AR media stream 1 to the second terminaldevice.

In this way, both the user 1 of the first terminal device and the user 2of the second terminal device can watch the display interface on whichthe user 1 performs annotation.

In some embodiments, the second terminal device also has the ARcapability. For example, the second terminal device supports a user inannotating a shared picture.

1406: The second terminal device obtains pose data 2 generated throughannotation performed by the user 2 on the display interfacecorresponding to the media stream, and sends the pose data 2 as ARauxiliary data to the IMS AGW 2 through the auxiliary data channel 2.

1407: The IMS AGW 2 renders the media stream based on the pose data 2 toobtain an AR media stream 2.

1408 a: The IMS AGW 2 sends the AR media stream 2 to the IMS AGW 1.

1408 b: The IMS AGW 2 sends the AR media stream 2 to the second terminaldevice.

1409: The IMS AGW 1 sends the AR media stream 2 to the first terminaldevice through the media stream channel 1. In this way, both the user 1of the first terminal device and the user 2 of the second terminaldevice can watch the display interface on which the user 2 performsannotation.

With reference to an embodiment, the following describes a procedure ofobtaining a target object in this embodiment of this application.

FIG. 15 is a schematic flowchart of obtaining a possible target objectaccording to an embodiment of this application.

1501: The first terminal device sends first AR auxiliary data to an ARmedia processing network element through an auxiliary data channel in aprocess of AR communication between a first terminal device and a secondterminal device. The first AR auxiliary data includes information abouta target object that needs to be used in the AR communication. The ARmedia processing network element mentioned herein is an IMS accessgateway having an AR processing capability or an MRF. It may beunderstood that an auxiliary data channel is established between the ARmedia processing network element and the first terminal device. The ARmedia processing network element may be located in an IMS network towhich the first terminal device belongs, namely, an IMS AGW or an MRFcorresponding to the first terminal device. The AR media processingnetwork element may be located in an IMS network to which the secondterminal device belongs, namely, an IMS AGW or an MRF corresponding tothe second terminal device.

1502: When receiving the first AR auxiliary data from the first terminaldevice through the auxiliary data channel, the AR media processingnetwork element obtains the target object from a third-party serverbased on the first AR auxiliary data. It may be understood that thetarget object is used to augment a media stream for the AR communicationbetween the first terminal device and the second terminal device.

For example, in some application scenarios, a virtual object, forexample, a 2D/3D object, needs to be used, but the terminal deviceusually does not have a capability of storing a large quantity of 2D/3Dobjects. Therefore, in the foregoing manner, a requirement of a terminaldevice user may be received through the auxiliary data channel, and the2D/3D object may be obtained from the third-party server.

For example, in some application scenarios, when a real backgroundpicture needed by a foreground object is relatively fixed, to reduceuplink bandwidth pressure on a terminal device side, the AR mediaprocessing network element may render a foreground picture and thebackground picture. In this case, the terminal device may sendrequirement information of the real background picture to the AR mediaprocessing network element through the auxiliary data channel, so thatthe AR media processing network element obtains the real backgroundpicture from the third-party server based on the requirementinformation, and further renders the real background picture and aforeground media stream sent by the terminal device and sends a mediastream obtained by rendering the rendered background picture and theforeground media stream to the terminal device.

In a possible implementation, the AR media processing network elementobtains address information of an AS in advance, and when obtaining atarget object, the AR media processing network element obtains thetarget object from the third-party server via the AS. For a specificobtaining manner, refer to the foregoing descriptions. Details are notdescribed herein again.

In another possible implementation, the AR media processing networkelement obtains address information of the third-party server inadvance, so that when obtaining a target object, the AR media processingnetwork element may directly obtain the target object from thethird-party server based on the address information of the third-partyserver.

With reference to specific embodiments, the following describes aprocedure of obtaining a virtual object. For details, refer to FIG. 16and FIG. 17 .

Refer to FIG. 16 . For example, an AR media processing network elementobtains a virtual object from a third-party server via an AS. The ARmedia processing network element is an IMS AGW/MRF (IMS AGW/MRF). TheIMS AGW may be an IMS AGW to which a first terminal device or a secondterminal device at a communication peer belongs. The MRF may be an MRFto which a first terminal device or a second terminal device at acommunication peer belongs.

1601: The first terminal device sends, to the IMS AGW/MRF through anauxiliary data channel, information about a virtual object, for example,an identifier of the virtual object or a type of the virtual object.

1602: When receiving the information about the virtual object, the IMSAGW/MRF sends a request message 1 to an application server, where therequest message 1 includes the information about the virtual object. Thetype of the virtual object is used as an example. For example, therequest message 1 may be of an H I IP message type. Certainly, anothermessage type may alternatively be used. This is not limited in thisapplication. For example, the request message 1 may be an HTTP request(req) message 1. For example, the request message 1 includes anidentifier of a terminal device.

1603: The application server receives the request message 1, and whendetermining that the first terminal device needs to request the virtualobject, forwards the request message 1 to the third-party server. Forexample, the application server may extract the identifier of theterminal device from the request message 1, and determine, based on theidentifier of the terminal device, an identifier of a sessionestablished by the terminal device. The application server mayestablish, to the third-party server based on the identifier of thesession, a communication connection, for example, an HI IP connection,so that the application server forwards the request message 1 to thethird-party server, to obtain the virtual object from the third-partyserver. A session identifier of the first terminal device may be addedto the request message 1 forwarded to the third-party server.

1604: The third-party server receives the request message 1, andobtains, based on the type of the virtual object, a plurality of virtualobject identifiers that meet the type of the virtual object. An examplein which a virtual object list includes the plurality of virtual objectidentifiers is used. Certainly, the plurality of virtual objectidentifiers may alternatively be sent in another form. This is notspecifically limited in this application. The third-party server sends aresponse message 1 to the application server. The response message 1 mayinclude the virtual object list. The response message 1 may be of anHTTP message type. For example, the response message 1 may be an HTTPresponse (resp) message 1. For example, the response message 1 includesa session identifier.

1605: The application server forwards the response message 1 to the IMSAGW/MRF. The application server determines the identifier of theterminal device based on the session identifier in the response message1, and adds the identifier of the first terminal device to the responsemessage 1 forwarded to the IMS AGW/MRF.

1606: The IMS AGW/MRF obtains the virtual object list from the responsemessage 1 from the application server, and sends the virtual object listto the first terminal device through the auxiliary data channel.Optionally, after obtaining the virtual object list, the IMS AGW/MRFstores the virtual object list. Subsequently, when receiving virtualobject information corresponding to the virtual object list through theauxiliary data channel, the IMS AGW/MRF may no longer obtain the virtualobject list from the third-party server. It may be understood that, instep 1602, when it is determined that the IMS AGW/MRF stores the virtualobject list corresponding to the virtual object information, the IMSAGW/MRF may no longer perform the subsequent step of obtaining thevirtual object list.

1607: The first terminal device sends a first virtual object identifierto the IMS AGW/MRF through the auxiliary data channel. The first virtualobject identifier is one in the virtual object list.

1608: When receiving the first virtual object identifier, the IMSAGW/MRF sends a request message 2 to the application server, where therequest message 2 includes the first virtual object identifier. Forexample, the request message 2 may be of an HI IP message type. Forexample, the request message 2 may be an HTTP request (req) message 2.

1609: The application server receives the request message 2, andforwards the request message 2 to the third-party server.

1610: The third-party server receives the request message 2, and obtainsa first virtual object corresponding to the first virtual objectidentifier. The third-party server sends a response message 2 to theapplication server. The response message 2 may include the first virtualobject. The response message 2 may be of an HTTP message type. Forexample, the response message 2 may be an HTTP response (resp) message2.

1611: The application server forwards the response message 2 to the IMSAGW/MRF.

1612: The IMS AGW/MRF obtains the first virtual object from the responsemessage 2, and sends the first virtual object to the first terminaldevice through the auxiliary data channel.

Refer to FIG. 17 . For example, an AR media processing network elementdirectly obtains a virtual object from a third-party server. The ARmedia processing network element is an IMS AGW/MRF. The IMS AGW may bean IMS AGW to which a first terminal device or a second terminal deviceat a communication peer belongs. The MRF may be an MRF to which a firstterminal device or a second terminal device at a communication peerbelongs.

1701: The first terminal device sends, to the IMS AGW/MRF through anauxiliary data channel, information about a virtual object, for example,an identifier of the virtual object or a type of the virtual object.

1702: When receiving the information about the virtual object, the IMSAGW/MRF sends a request message 11 to the third-party server. Therequest message 11 may include the information about the virtual object.

1703: The third-party server receives the request message 11, andobtains, based on the type of the virtual object, a plurality of virtualobject identifiers that meet the type of the virtual object. An examplein which a virtual object list includes the plurality of virtual objectidentifiers is used. Certainly, the plurality of virtual objectidentifiers may alternatively be sent in another form. This is notspecifically limited in this application. The third-party server sends aresponse message 11 to the IMS AGW/MRF. The response message 11 mayinclude the virtual object list. The response message 11 may be of an HIIP message type. For example, the response message 11 may be an HTTPresponse (resp) message 11.

1704: The IMS AGW/MRF obtains the virtual object list from the responsemessage 11, and sends the virtual object list to the first terminaldevice through the auxiliary data channel. Optionally, after obtainingthe virtual object list, the IMS AGW/MRF stores the virtual object list.Subsequently, when receiving virtual object information corresponding tothe virtual object list through the auxiliary data channel, the IMSAGW/MRF may no longer obtain the virtual object list from thethird-party server. It may be understood that, in step 1702, when it isdetermined that the IMS AGW/MRF stores the virtual object listcorresponding to the virtual object information, the IMS AGW/MRF may nolonger perform the subsequent step of obtaining the virtual object list.

1705: The first terminal device sends a first virtual object identifierto the IMS AGW/MRF through the auxiliary data channel. The first virtualobject identifier is one in the virtual object list.

1706: When receiving the first virtual object identifier, the IMSAGW/MRF sends a request message 12 to the third-party server.

1707: The third-party server receives the request message 12, andobtains a first virtual object corresponding to the first virtual objectidentifier. The third-party server sends a response message 12 to theIMS AGW/MRF. The response message 12 may include the first virtualobject. The response message 12 may be of an HTTP message type. Forexample, the response message 12 may be an HTTP response (resp) message12.

1708: The IMS AGW/MRF obtains the first virtual object from the responsemessage 12, and sends the first virtual object to the first terminaldevice through the auxiliary data channel.

With reference to specific embodiments, the following describes a mediaaugmented procedure of obtaining and using a real background object. Fordetails, refer to FIG. 18 and FIG. 19 .

Refer to FIG. 18 . For example, an AR media processing network elementobtains a real background object from a third-party server via an AS.The AR media processing network element is an IMS AGW/MRF. The IMS AGWmay be an IMS AGW to which a first terminal device or a second terminaldevice at a communication peer belongs. The MRF may be an MRF to which afirst terminal device or a second terminal device at a communicationpeer belongs.

1801: The first terminal device sends, to the IMS AGW/MRF through anauxiliary data channel, information about the real background object,for example, an identifier of the real background object or ageographical location of the real background object.

1802: The first terminal device sends a media stream 1 generated by thefirst terminal device to the IMS AGW/MRF through a media stream channel.

1803: When receiving the information about the real background object,the IMS AGW/MRF sends a request message 21 to the application server,where the request message 21 includes the information about the realbackground object. The geographical location of the real backgroundobject is used as an example. For example, the request message 21 may beof an HTTP message type. Certainly, another message type mayalternatively be used. This is not limited in this application. Forexample, the request message 21 may be an HTTP request (req) message 21.

1804: The application server receives the request message 21, and whendetermining that the first terminal device needs to request the realbackground object, forwards the request message 21 to the third-partyserver.

1805: The third-party server receives the request message 21, andobtains the real background object based on the geographical location ofthe real background object. The third-party server sends a responsemessage 21 to the application server. The response message 21 mayinclude a real background object list. The response message 21 may be ofan HTTP message type. For example, the response message 21 may be anHTTP response (resp) message 21.

1806: The application server forwards the response message 21 to the IMSAGW/MRF.

1807: The IMS AGW/MRF obtains the real background object from theresponse message 21, and renders the real background object to the mediastream 1 to obtain an AR media stream 2.

1808: The IMS AGW/MRF sends the AR media stream 2 to the first terminaldevice through the media stream channel.

Refer to FIG. 19 . For example, an AR media processing network elementdirectly obtains a real background object from a third-party server. TheAR media processing network element is an IMS AGW/MRF. The IMS AGW maybe an IMS AGW to which a first terminal device or a second terminaldevice at a communication peer belongs. The MRF may be an MRF to which afirst terminal device or a second terminal device at a communicationpeer belongs.

1901: The first terminal device sends, to the IMS AGW/MRF through anauxiliary data channel, information about the real background object,for example, an identifier of the real background object or ageographical location of the real background object.

1902: The first terminal device sends a media stream 1 generated by thefirst terminal device to the IMS AGW/MRF through a media stream channel.

1903: When receiving the information about the real background object,the IMS AGW/MRF sends a request message 22 to the third-party server,where the request message 22 includes the information about the realbackground object. The geographical location of the real backgroundobject is used as an example. For example, the request message 22 may beof an HTTP message type. Certainly, another message type mayalternatively be used. This is not limited in this application. Forexample, the request message 22 may be an HTTP request (request, req)message 22.

1904: The third-party server receives the request message 22, andobtains the real background object based on the geographical location ofthe real background object. The third-party server sends a responsemessage 22 to the IMS AGW/MRF. The response message 22 may include areal background object list. The response message 22 may be of an HTTPmessage type. For example, the response message 22 may be an HTTPresponse (response) message 22.

1905: The IMS AGW/MRF obtains the real background object from theresponse message 22, and renders the real background object to the mediastream 1 to obtain an AR media stream 2.

1906: The IMS AGW/MRF sends the AR media stream 2 to the first terminaldevice through the media stream channel.

Based on a same inventive concept as the method embodiments, anembodiment of this application provides an apparatus, specificallyconfigured to implement the method performed by the IMS core networkelement in the foregoing method embodiments. A structure of theapparatus is shown in FIG. 20 , and includes a receiving unit 2001, asending unit 2002, and a processing unit 2003.

The receiving unit 2001 is configured to receive a first request messagefrom a first terminal device, where the first request message is forrequesting for establishing of an auxiliary data channel, and theauxiliary data channel is for transmitting auxiliary data for ARcommunication between the first terminal device and an AR mediaprocessing network element.

The processing unit 2003 is configured to control establishing of theauxiliary data channel between the AR media processing network elementand the first terminal device.

In a possible implementation, the processing unit 2003 controls, via thereceiving unit 2001 and the sending unit 2002, establishing of theauxiliary data channel between the AR media processing network elementand the first terminal device.

The sending unit 2002 is configured to send a second request message tothe AR media processing network element, where the second requestmessage is for requesting for establishing of the auxiliary data channelbetween the AR media processing network element and the first terminaldevice.

The receiving unit 2001 is further configured to receive a secondresponse message sent by the AR media processing network element, wherethe second response message indicates to the IMS core network elementthat the AR media processing network element agrees upon theestablishing of the auxiliary data channel.

The sending unit 2002 is further configured to send a first responsemessage to the first terminal device, where the first response messageindicates to the first terminal device that the AR media processingnetwork element is ready for establishing of the auxiliary data channel.

In a possible implementation, the AR media processing network element isan IMS access gateway having an AR processing capability or a mediaresource function MRF.

In a possible implementation, the AR media processing network element islocated in an IMS network to which the first terminal device belongs oris located in an IMS network to which the second terminal devicebelongs.

In a possible implementation, the first request message and the secondrequest message carry a first description parameter that is of the firstterminal device and that is for establishing the auxiliary data channel.

The first response message and the second response message carry asecond description parameter that is of the AR media processing networkelement and that is for establishing the auxiliary data channel.

In a possible implementation, the sending unit 2002 is furtherconfigured to send a query request to the AR media processing networkelement, where the query request is for querying whether the AR mediaprocessing network element has an AR processing capability.

The receiving unit 2001 is further configured to receive a queryresponse sent by the AR media processing network element, where thequery response carries information indicating that the AR mediaprocessing network element has the AR processing capability.

In a possible implementation, the first request message is further forrequesting for establishing of a media stream channel, where the mediastream channel is for transmitting a media stream for transmittingcontent for AR communication between the first terminal device and theAR media processing network element.

The processing unit 2003 is further configured to control establishingof the media stream channel between the AR media processing networkelement and the first terminal device.

In a possible implementation, the processing unit 2003 controls, via thesending unit 2002 and the receiving unit 2001, establishing of the mediastream channel between the AR media processing network element and thefirst terminal device.

The sending unit 2002 is further configured to send a second requestmessage to the AR media processing network element, where the secondrequest message is for requesting for establishing of the media streamchannel between the AR media processing network element and the firstterminal device.

The receiving unit 2001 is further configured to receive a secondresponse message sent by the AR media processing network element, wherethe second response message indicates to the IMS core network elementthat the AR media processing network element agrees upon theestablishing of the media stream channel.

The sending unit 2002 is further configured to send a first responsemessage to the first terminal device, where the first response messageindicates to the first terminal device that the AR media processingnetwork element is ready for establishing of the media stream channel.

Based on a same inventive concept as the method embodiments, anembodiment of this application provides an apparatus, specificallyconfigured to implement the method performed by the AR media processingnetwork element in the foregoing method embodiments. A structure of theapparatus is shown in FIG. 21 , and includes a receiving unit 2101, asending unit 2102, and a processing unit 2103.

The receiving unit 2101 is configured to receive a second requestmessage sent by an internet protocol multimedia subsystem IMS corenetwork element, where the second request message is for requesting forestablishing of an auxiliary data channel between the AR mediaprocessing network element and a first terminal device, and theauxiliary data channel is for transmitting auxiliary data for ARcommunication between the first terminal device and the AR mediaprocessing network element.

The sending unit 2102 is configured to send a second response message tothe IMS core network element, where the second response messageindicates to the IMS core network element that the AR media processingnetwork element agrees upon the establishing of the auxiliary datachannel.

In a possible implementation, the receiving unit 2101 is specificallyconfigured to receive AR auxiliary data from the first terminal devicethrough the auxiliary data channel after the establishment of theauxiliary data channel is completed.

In a possible implementation, a media stream channel for transmittingcontent for AR communication is established between the AR mediaprocessing network element and the first terminal device.

In a possible implementation, the receiving unit 2101 is configured toreceive a media stream for the AR communication from the first terminaldevice through the media stream channel.

The processing unit 2103 is configured to perform media augmentedprocessing on the media stream for the AR communication based on the ARauxiliary data to obtain an AR media stream.

The sending unit 2102 is configured to send the AR media stream obtainedthrough augmented processing to the first terminal device through themedia stream channel.

In a possible implementation, the sending unit 2102 is furtherconfigured to send the AR media stream obtained through augmentedprocessing to a second terminal device.

In a possible implementation, the AR auxiliary data includes one or moreof operation data of a virtual model, annotation or tracking data of atarget object, identification data of a facial expression or a bodyaction, or pose data.

In a possible implementation, the AR media processing network element isan IMS access gateway having an AR processing capability or a mediaresource function MRF.

In a possible implementation, the AR media processing network element islocated in an IMS network to which the first terminal device belongs oris located in an IMS network to which the second terminal devicebelongs.

In a possible implementation, the second request message carries a firstdescription parameter that is of the first terminal device and that isfor establishing the auxiliary data channel. The processing unit 2103 isspecifically configured to determine, from the first descriptionparameter, a second description parameter supported by the AR mediaprocessing network element, where the second response message carriesthe second description parameter.

Based on a same inventive concept as the method embodiments, anembodiment of this application provides an apparatus, specificallyconfigured to implement the method performed by the terminal device inthe foregoing method embodiments. A structure of the apparatus is shownin FIG. 22 , and includes a receiving unit 2201, a sending unit 2202,and a processing unit 2203.

The sending unit 2202 sends a first request message to an IMS corenetwork element, where the first request message is for requesting forestablishing of an auxiliary data channel. The receiving unit 2201receives a first response message sent by the IMS core network element,where the first response message indicates that the AR media processingnetwork element completes the establishment of the auxiliary datachannel, and the auxiliary data channel is for transmitting auxiliarydata for AR communication between the first terminal device and the ARmedia processing network element.

A media stream channel for transmitting content for the AR communicationis established between the first terminal device and the AR mediaprocessing network element.

In a possible implementation, the sending unit 2202 sends a thirdrequest message to the IMS core network element, where the third requestmessage is for requesting for establishing of the media stream channel.The receiving unit 2201 receives a third response message sent by theIMS core network element, where the third response message indicates tothe first terminal device that the AR media processing network elementis ready for establishing of the media stream channel.

In a possible implementation, the first request message is further forrequesting for establishing of the media stream channel. The firstresponse message further indicates that the AR media processing networkelement is ready for establishing of the media stream channel.

In a possible implementation, the sending unit 2202 sends augmentedreality AR auxiliary data to the AR media processing network elementthrough the auxiliary data channel, and sends a media stream to the ARmedia processing network element through the media stream channel, wherethe media stream is generated during AR communication between the firstterminal device and a second terminal device, where the AR auxiliarydata is used by the AR media processing network element to perform mediaaugmented processing on the media stream for the AR communication. Thereceiving unit 2201 receives an AR media stream obtained throughaugmented processing from the AR media processing network elementthrough the media stream channel.

In a possible implementation, the receiving unit 2201 receives, throughthe media stream channel, an AR media stream that is of the secondterminal device and that is sent by the AR media processing networkelement.

In a possible implementation, the first request message carries a firstdescription parameter that is of the first terminal device and that isfor establishing the auxiliary data channel.

The first response message carries a second description parameter thatis of the AR media processing network element and that is forestablishing the auxiliary data channel.

In a possible implementation, the AR media processing network element islocated in an IMS network to which the first terminal device belongs oris located in an IMS network to which the second terminal devicebelongs.

In embodiments of this application, division into the units is anexample and is merely logical function division, and may be otherdivision during actual implementation. In addition, functional units inembodiments of this application may be integrated into one processor,may exist alone physically, or two or more units may be integrated intoone unit. The foregoing integrated unit may be implemented in a form ofhardware, or may be implemented in a form of a software functional unit.

When the integrated unit is implemented in a form of a softwarefunctional unit and sold or used as an independent product, theintegrated unit may be stored in a computer-readable storage medium.Based on such an understanding, the technical solutions of thisapplication essentially, or the part contributing to a conventionaltechnology, or all or some of the technical solutions may be implementedin a form of a software product. The computer software product is storedin a storage medium and includes several instructions for instructing aterminal device (which may be a personal computer, a mobile phone, anetwork device, or the like) or a processor to perform all or some ofthe steps of the methods in embodiments of this application. Theforegoing storage medium includes: any medium that can store programcode, such as a USB flash drive, a removable hard disk, a read-onlymemory (ROM), a random access memory (random access memory, RAM), amagnetic disk, or an optical disc.

In embodiments of this application, the AR media processing networkelement, the application server, the IMS core network element, and theterminal device may all be presented in a form of functional modulesobtained through division in an integrated manner. The “module” hereinmay be an ASIC, a circuit, a processor executing one or more software orfirmware programs, a memory, an integrated logic circuit, and/or anotherdevice that can provide the foregoing functions.

In a simple embodiment, the AR media processing network element, theapplication server, the IMS core network element, and the terminaldevice may be implemented by using a structure shown in FIG. 23 .

The apparatus 2300 shown in FIG. 23 includes at least one processor 2301and a communication interface 2302, and optionally, may further includea memory 2303.

In a possible implementation, when the AR media processing networkelement is implemented by using the structure shown in FIG. 23 , theprocessor 2301 in FIG. 23 may invoke computer-executable instructionsstored in the memory 2303, so that the AR media processing networkelement may perform the method performed by the AR media processingnetwork element in any one of the foregoing method embodiments.

The processor 2301 may communicate with another device via thecommunication interface 2302. For example, the processor 2301 receives,via the communication interface 2302, a request message sent by an IMScore network element, and sends a response message to the IMS corenetwork element.

Specifically, the memory 2303 stores computer-executable instructionsfor implementing functions of the sending unit, the receiving unit, andthe processing unit in FIG. 21 . All functions/implementation processesof the sending unit, the receiving unit, and the processing unit in FIG.21 may be implemented by the processor 2301 in FIG. 23 by invoking thecomputer-executable instructions stored in the memory 2303 and incombination with the communication interface 2302.

In still another possible implementation, when the IMS core networkelement is implemented by using the structure shown in FIG. 23 , theprocessor 2301 in FIG. 23 may invoke the computer-executableinstructions stored in the memory 2303, so that the IMS core networkelement may perform the method performed by the IMS core network element(the IMS AGW 1, the IMS AGW 2, the MRF 1, or the MRF 2) in any one ofthe foregoing method embodiments.

The processor 2301 may communicate with another device via thecommunication interface 2302. For example, the processor 2301 receives,via the communication interface 2302, a message sent by the AR mediaprocessing network element and a message sent to the AR media processingnetwork element.

Specifically, the memory 2303 stores computer-executable instructionsfor implementing functions of the sending unit, the receiving unit, andthe processing unit in FIG. 20 . All functions/implementation processesof the sending unit, the receiving unit, and the processing unit in FIG.20 may be implemented by the processor 2301 in FIG. 23 by invoking thecomputer-executable instructions stored in the memory 2303 and incombination with the communication interface 2302.

In still another possible implementation, when the terminal device isimplemented by using the structure shown in FIG. 23 , the processor 2301in FIG. 23 may invoke computer-executable instructions stored in thememory 2303, so that the terminal device may perform the methodperformed by the first terminal device or the second terminal device inany one of the foregoing method embodiments.

The processor 2301 may communicate with another device via thecommunication interface 2302. For example, the processor 2301 receives,via the communication interface 2302, an augmented media stream and thelike sent by the AR media processing network element, and sends ARauxiliary data, a media stream, and the like to the AR media processingnetwork element.

Specifically, the memory 2303 stores computer-executable instructionsfor implementing functions of the sending unit, the receiving unit, andthe processing unit in FIG. 22 . All functions/implementation processesof the sending unit, the receiving unit, and the processing unit in FIG.22 may be implemented by the processor 2301 in FIG. 23 by invoking thecomputer-executable instructions stored in the memory 2303 and incombination with the communication interface 2302.

A specific connection medium between the processor 2301 and the memory2303 is not limited in this embodiment of this application. In thisembodiment of this application, the memory 2303 is connected to theprocessor 2301 through a bus 2304 in the figure. The bus 2304 isrepresented by a thick line in the figure. A connection manner betweenother components is schematically described, and is not limited thereto.The bus 2304 may be classified into an address bus, a data bus, acontrol bus, and the like. For ease of representation, only one thickline is used to represent the bus in FIG. 23 , but this does not meanthat there is only one bus or only one type of bus.

Based on the foregoing embodiments, an embodiment of this applicationfurther provides a computer storage medium. The storage medium stores asoftware program. When the software program is read and executed by oneor more processors, the method performed by a service platform or anedge computing device provided in any one or more of the foregoingembodiments may be implemented. The computer storage medium may includeany medium that can store program code, such as a USB flash drive, aremovable hard disk, a read-only memory, a random access memory, amagnetic disk, or an optical disc.

Based on the foregoing embodiments, an embodiment of this applicationfurther provides a chip. The chip includes a processor, configured toimplement a function of an AR media server or a function of the terminaldevice or the application server in any one or more of the foregoingembodiments. Optionally, the chip further includes a memory, and thememory is configured to store necessary program instructions and datathat are executed by the processor. The chip may include a chip, or mayinclude a chip and another discrete device.

It can be understood that “an embodiment”, “an implementation”, or “anexample” mentioned in the entire specification means that particularfeatures, structures, or characteristics related to embodiments areincluded in at least one embodiment of this application. Therefore, “inan embodiment”, “in an implementation”, or “in an example” appearingthroughout the specification does not necessarily refer to a sameembodiment. In addition, these particular features, structures, orcharacteristics may be combined in one or more embodiments in any propermanner. It can be understood that sequence numbers of the foregoingprocesses do not mean execution sequences in embodiments of thisapplication. The execution sequences of the processes should bedetermined based on functions and internal logic of the processes, andshould not constitute any limitation on the implementation processes ofembodiments of this application.

In addition, the terms “system” and “network” are usually usedinterchangeably in this specification. The term “and/or” in thisspecification describes only an association relationship for describingassociated objects and represents that three relationships may exist.For example, A and/or B may represent the following three cases: Only Aexists, both A and B exist, and only B exists. In addition, thecharacter “/” in this specification usually indicates an “or”relationship between the associated objects. The term “at least one” inthis application means one or more, that is, includes one, two, three,or more; and the term “a plurality of” means two or more, that is,includes two, three, or more. In addition, it can be understood that inthe descriptions of this application, the terms such as “first” and“second” are merely used for distinguishing and description, but cannotbe understood as indicating or implying relative importance, or cannotbe understood as indicating or implying a sequence. The term “at leastone of the following items (pieces)” or a similar expression thereofindicates any combination of these items, including a single item(piece) or any combination of a plurality of items (pieces). Forexample, at least one item (piece) of a, b, or c may indicate: a, b, c,a and b, a and c, b and c, or a, b, and c, where a, b, and c may besingular or plural. It can be understood that in embodiments of thisapplication, “B corresponding to A” indicates that B is associated withA, and B may be determined according to A. However, it can be furtherunderstood that determining B based on A does not mean that B isdetermined based only on A. B may alternatively be determined based on Aand/or other information. In addition, the terms “include” and “have” inembodiments, claims, and accompanying drawings of this application arenot exclusive. For example, a process, method, system, product, ordevice including a series of steps or modules is not limited to theenumerated steps or modules, and may further include a step or modulethat is not enumerated.

It may be understood that, the processor in embodiments of thisapplication may be a central processing unit (CPU), or may be anothergeneral-purpose processor, a digital signal processor (DSP), anapplication-specific integrated circuit (ASIC), a field programmablegate array (FPGA) or another programmable logic device, a transistorlogic device, a hardware component, or any combination thereof. Thegeneral-purpose processor may be a microprocessor or any regularprocessor or the like.

The method steps in embodiments of this application may be implementedin a hardware manner, or may be implemented in a manner of executingsoftware instructions by the processor. The software instructions mayinclude a corresponding software module. The software module may bestored in a random access memory (RAM), a flash memory, a read-onlymemory (ROM), a programmable read-only memory (PROM), an erasableprogrammable read-only memory (EPROM), an electrically erasableprogrammable read-only memory (EEPROM), a register, a hard disk, aremovable hard disk, a CD-ROM, or any other form of storage mediumwell-known in the art. For example, a storage medium is coupled to aprocessor, so that the processor can read information from the storagemedium and write information into the storage medium. Certainly, thestorage medium may be a component of the processor. The processor andthe storage medium may be located in the ASIC. In addition, the ASIC maybe located in a network device or a terminal device. Certainly, theprocessor and the storage medium may exist in the network device or theterminal device as discrete components.

All or some of the foregoing embodiments may be implemented by software,hardware, firmware, or any combination thereof. When software is used toimplement the embodiments, all or some of the embodiments may beimplemented in a form of a computer program product. The computerprogram product includes one or more computer programs or instructions.When the computer programs or the instructions are loaded and executedon a computer, the procedures or the functions according to embodimentsof this application are all or partially executed. The computer may be ageneral-purpose computer, a dedicated computer, a computer network, oranother programmable apparatus. The computer programs or theinstructions may be stored in a computer-readable storage medium, or maybe transmitted by using a computer-readable storage medium. Thecomputer-readable storage medium may be any usable medium accessible bythe computer, or a data storage device such as a server integrating oneor more usable media. The usable medium may be a magnetic medium, forexample, a floppy disk, a hard disk, or a magnetic tape; may be anoptical medium, for example, a DVD; or may be a semiconductor medium,for example, a solid-state disk (solid-state disk, SSD).

In various embodiments of this application, unless otherwise stated orthere is a logic conflict, terms and/or descriptions in differentembodiments are consistent and may be mutually referenced, and technicalfeatures in different embodiments may be combined based on an internallogical relationship thereof, to form a new embodiment.

What is claimed is:
 1. An augmented reality AR communication method,comprising: receiving, by an internet protocol multimedia subsystem IMScore network element, a first request message from a first terminaldevice, wherein the first request message is for requesting forestablishing of an auxiliary data channel, and the auxiliary datachannel is for transmitting auxiliary data for AR communication betweenthe first terminal device and an AR media processing network element;and controlling, by the IMS core network element, establishing of theauxiliary data channel between the AR media processing network elementand the first terminal device.
 2. The method according to claim 1,wherein the controlling, by the IMS core network element, establishingof the auxiliary data channel between the AR media processing networkelement and the first terminal device comprises: sending, by the IMScore network element, a second request message to the AR mediaprocessing network element, wherein the second request message is forrequesting for establishing of the auxiliary data channel between the ARmedia processing network element and the first terminal device;receiving, by the IMS core network element, a second response messagesent by the AR media processing network element, wherein the secondresponse message indicates to the IMS core network element that the ARmedia processing network element agrees upon the establishing of theauxiliary data channel; and sending, by the IMS core network element, afirst response message to the first terminal device, wherein the firstresponse message indicates to the first terminal device that the ARmedia processing network element is ready for establishing of theauxiliary data channel.
 3. The method according to claim 1, wherein theAR media processing network element is an IMS access gateway having anAR processing capability or a media resource function MRF.
 4. The methodaccording to claim 1, wherein the AR media processing network element islocated in an IMS network to which the first terminal device belongs oris located in an IMS network to which a second terminal device belongs.5. The method according to claim 1, wherein the first request messageand the second request message carry a first description parameter thatis of the first terminal device and that is for establishing theauxiliary data channel; and the first response message and the secondresponse message carry a second description parameter that is of the ARmedia processing network element and that is for establishing theauxiliary data channel.
 6. The method according to claim 2, wherein themethod further comprises: sending, by the IMS core network element, aquery request to the AR media processing network element, wherein thequery request is for querying whether the AR media processing networkelement has an AR processing capability; and receiving, by the IMS corenetwork element, a query response sent by the AR media processingnetwork element, wherein the query response carries informationindicating that the AR media processing network element has the ARprocessing capability.
 7. The method according to claim 1, wherein themethod further comprises: controlling, by the IMS core network element,establishing of a media stream channel between the AR media processingnetwork element and the first terminal device, wherein the media streamchannel is for transmitting a media stream for transmitting content forAR communication between the first terminal device and the AR mediaprocessing network element.
 8. The method according to claim 7, furthercomprising: receiving, by the IMS core network element, a third requestmessage from the first terminal device, wherein the third requestmessage is further for requesting for establishing of the media streamchannel; and the controlling, by the IMS core network element,establishing of a media stream channel between the AR media processingnetwork element and the first terminal device comprises: sending, by theIMS core network element, a fourth request message to the AR mediaprocessing network element, wherein the fourth request message is forrequesting for establishing of the media stream channel between the ARmedia processing network element and the first terminal device;receiving, by the IMS core network element, a fourth response messagesent by the AR media processing network element, wherein the fourthresponse message indicates to the IMS core network element that the ARmedia processing network element agrees upon the establishing of themedia stream channel; and sending, by the IMS core network element, athird response message to the first terminal device, wherein the thirdresponse message indicates to the first terminal device that the ARmedia processing network element is ready for establishing of the mediastream channel.
 9. An augmented reality AR communication method,comprising: receiving, by an AR media processing network element, asecond request message sent by an internet protocol multimedia subsystemIMS core network element, wherein the second request message is forrequesting for establishing of an auxiliary data channel between the ARmedia processing network element and a first terminal device, and theauxiliary data channel is for transmitting auxiliary data for ARcommunication between the first terminal device and the AR mediaprocessing network element; and sending, by the AR media processingnetwork element, a second response message to the IMS core networkelement, wherein the second response message indicates to the IMS corenetwork element that the AR media processing network element agrees theestablishing of the auxiliary data channel.
 10. The method according toclaim 9, wherein the method further comprises: receiving, by the ARmedia processing network element, AR auxiliary data from the firstterminal device through the auxiliary data channel after theestablishment of the auxiliary data channel is completed.
 11. The methodaccording to claim 9, wherein the method further comprises:establishing, a media stream channel between the AR media processingnetwork element and the first terminal device, wherein the media streamchannel is for transmitting content for AR communication.
 12. The methodaccording to claim 11, wherein the method further comprises: receiving,by the AR media processing network element, a media stream for the ARcommunication from the first terminal device through the media streamchannel; performing, by the AR media processing network element, mediaaugmented processing on the media stream for the AR communication basedon the AR auxiliary data to obtain an AR media stream; and sending, bythe AR media processing network element, the AR media stream obtainedthrough augmented processing to the first terminal device through themedia stream channel.
 13. The method according to claim 12, wherein themethod further comprises: sending, by the AR media processing networkelement, the AR media stream obtained through augmented processing to asecond terminal device.
 14. The method according to claim 10, whereinthe AR auxiliary data comprises one or more of operation data of avirtual model, annotation or tracking data of a target object,identification data of a facial expression or a body action, or posedata.
 15. The method according to claim 9, wherein the AR mediaprocessing network element is an IMS access gateway having an ARprocessing capability or a media resource function MRF.
 16. The methodaccording to claim 9, wherein the AR media processing network element islocated in an IMS network to which the first terminal device belongs oris located in an IMS network to which the second terminal devicebelongs.
 17. The method according to claim 9, wherein the second requestmessage carries a first description parameter that is of the firstterminal device and that is for establishing the auxiliary data channel,and the method further comprises: determining, by the AR mediaprocessing network element from the first description parameter, asecond description parameter supported by the AR media processingnetwork element, wherein the second response message carries the seconddescription parameter.
 18. An augmented reality AR communication method,comprising: sending, by a first terminal device, a first request messageto an IMS core network element, wherein the first request message is forrequesting for establishing of an auxiliary data channel; and receiving,by the first terminal device, a first response message sent by the IMScore network element, wherein the first response message indicates thatthe AR media processing network element completes the establishment ofthe auxiliary data channel, and the auxiliary data channel is fortransmitting auxiliary data for AR communication between the firstterminal device and the AR media processing network element.
 19. Themethod according to claim 18, wherein the method further comprises:establishing, between the first terminal device and the AR mediaprocessing network element, a media stream channel for transmittingcontent for the AR communication.
 20. The method according to claim 18,wherein the establishing, between the first terminal device and the ARmedia processing network element, a media stream channel fortransmitting content for the AR communication comprises: sending, by thefirst terminal device, a third request message to the IMS core networkelement, wherein the third request message is for requesting forestablishing of the media stream channel; and receiving, by the firstterminal device, a third response message sent by the IMS core networkelement, wherein the third response message indicates to the firstterminal device that the AR media processing network element is readyfor establishing of the media stream channel.